Number Theory 


Introduction 


Vision 


The goal of this open-source number theory textbook is to gather up all the core subfields of 
number theory into one text. By making it open-source, everyone will be able to contribute 
in terms of adding new material and improving existing material, and tailor it to their own 
learning or teaching. 

It is an era of mass collaboration, in mathematics and many other fields. I aim to follow 
the example of other successful online textbooks such as CRing and the Stacks Project. 
Because we all are good at different subfields, we will be able to achieve much more if each 
person writes notes on an area they have studied well. For instance, I have found that one 
of the best ways to learn a semi-advanced topic is to find an undergraduate thesis on it: 
because the author has spent a long time thinking on the subject, had long conversations 
with advisors, and taken time to lay out the big motivations and deeper connections. 

The philosophy behind this textbook is the following. If you’d like to contribute to this 
work I’d recommend following these guidelines. 


1. Take a problem-oriented approach. In other words, give motivation behind abstract 
theory by relating them to interesting, concrete problems. 


2. Create a user-friendly learning resource: Start each chapter by telling the reader 
why the material matters, what problems in number theory it solves, and how it fits 
into the big picture.! For instance, class field theory gives a way to understand field 
extensions through information intrinsic to the field, it gives framework for reciprocity 
laws (among many other things, see chapter 229), and it is part of the larger Langlands 
program. Give a summary of the takeaway ideas after the chapter, and exercises that 
help the reader conceptually grasp the material. Always motivate proofs, especially 
long hard ones, and tell the reader why the big theorems matter. Make connections 
between different ways of approaching a particular problems. Highlight recurring tech- 
niques. 


(These points have not all been implemented in the chapters here, but it is what I’m 
shooting for.) 


'T learned this style from Patrick Winston. You can see it in action in his textbook on Artificial Intelli- 
gence. 


3. Be a self-contained work: This means that proofs should refer to theorems in the 
text itself, possibly from a previous part (for instance, refer to the Algebraic Number 
Theory section in the Class Field Theory section.) (I may eventually move some of 
the material into appendices, such as the complex analysis background required for 
analytic number theory.) 


4. Connect up elementary with advanced number theory, and offer a road map of the 
subject. 


Some more notes: 


1. All material is under a creative commons license. 


2. This will be posted open-source on my website, http: //web.mit.edu/~holdenl/ 
www/math/notes.htm 


3. Please contribute! You will be credited. It doesn’t have to be finished/polished stuff— 
after all, it takes much less time to edit material that’s already written than to write 
it myself. 


4. Much of this is in very rough shape right now. 
5. Email suggestions and corrections to holdenl@mit.edu. 


6. (*) denotes optional material. (}) denotes theorems that will be made obsolete by 
stronger theorems later (for example, the p = 1 (mod n) case of Dirichlet’s theorem), 
and so can be skipped. 


This file was last updated June 12, 2014. 
Contributors: Holden Lee, Oleg Muskarov, Teo Andrica 
Thanks for proofreading: Delong Meng, Timo Keller 
For a list of references, see refs.bib. 
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Part I 


Elementary Number Theory 


Chapter 1 


Factorization and Divisibility 


1 Divisibility 
1.1. The division algorithm 


2 Primes 
3 LCM and GCD 


3.1 The Euclidean algorithm 
4 Look at the exponent 


Theorem 1.4.1: We have that 


wi (3]- lla 


k=1 


Given that n = 7)_, agp*, find 


ord,(n!) = 


Proof. 


Example 1.4.2 (AMC ??): Let x and y be positive integers such that 77° = 1ly!®. The 
minimum possible value of « can be written in the form a°b¢ where a,b,c,d are positive 
integers. Compute a+b+c+d. 
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Chapter 2 


Modular Arithmetic 


1 Modular Arithmetic 


Let a,b be integers and let m be a positive integer. We say that a and 6} are congruent 
modulo m if m divides a — b. This is denoted as a= 6b mod m. If m does not divide a — b, 
then we write a # b modm. The relation a = 6 for integers a,b has many of the same 
properties as the relation a = b. 


Proposition 2.1.1: The following properties hold for integers a,b,c and positive integers 
mM. 


1.a=a modm; 

2. Ifa=b mod™m, then b=a mod m;; 

3. fa=b modmandb=c modm, thena=c mod ™; 

4. Ifa; =b; mod m for 1 <i<n, then aj + a2 +---+a, =b; +bo+---+6, mod m; 
5. Ifa+b=c modm, thena=c—b mod Mm; 

6. Ifa=b modm, thena+c=b+c mod m; 

7. Ifa; =b; mod ™m, then ajag--- an = bib2---b, mod m:; 

8. Ifa=b mod™m, then ac = bec mod m; 

9. Ifa=b mod ™m, then a” = b" mod m for all positive integers n; 


10. If a=b mod ™m and f(z) is a polynomial with integer coefficients, then f(a) = f(b) 
mod m. 


Proof. The above properties can be proven as follows: 


1. m|a—a=0 for all m. 
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10. 


. Asm|a—b,a—b=km. Then b—a= (—k)m, sob=a mod m. 
. Asm|a—b,b—c, we have m|(a—b)+(b-—c)=a-—c. Hencec=a mod m. 


. Asm |a;— 6; for 1 <i <n, we have m | (a; — b1) + (a2 — bg) +--+ + (an — bn). Hence 


a, t+ dg+-+- +a, =b) +bo+---+b, mod m. 


. Asm | (a+b) —c, we have m|a—(c—b). Hencea=c—b modm. 


. Asm |a—b and m|c-—c, we have m| (a—6b)+ (c—c) = (a+c) — (b+ c). Hence 


atc=b+e mod ™m. 


. As m | a; — 6;, we have a; — 6; = tim for integers t; and 1 <i <n. Hence ajag:-- ay) = 


(by) +t1m)(bo+tom) ---(b,+t,m). Expanding the left side gives the form b,b2---b,+tm 
for some integer t. Hence a,a9--- da, = bjb2---b, mod m. 


. Ifm|a— 6, then m | c(a — b) = (ca) — (cb). Hence ca = cb mod m. 


. Set a; =a and b; = 6 for 1 <i <n and use result 7. 


Set f(x) = coteyra+:+-+cenx2". Then f(a)—f(b) = cy(a—b)+c2(a?—b?)+- + -+e,(a"—b"). 
All of these terms are divisible by a — b, hence a— 6 | f(a) — f(b). Asm |a—b, we 
have thus m | f(a) — f(b). Hence f(a) = f(b) mod m as desired. 


Proposition 2.1.2: 1. Ifa=b mod m, then gcd(a,m) = ged (b,m). 


2 


3. 


4. 


a=b mod m if and only if a and b have the same remainder upon division by m. 
a=b mod m, for 1<i< nif and only ifa@=b mod lem (m1, mz,--- ,mMn). 


If ka = kb mod m, then a =b mod (ap) In particular, if ged(m,k) = 1, then 
a=b modm. 


Proof. We show the desired results as follows: 


ip 


As a = b mod™m, we have m | a — 6, and thus a — b = tm for some integer t. 
Let d,; = ged(a,m),dz = gcd(b,m). Then d; | a,m, so d; | a— tm = b. Hence 
d, | gcd(b,m) = dy. We may similarly show dz | d;. As d,,d2 > 0, we have thus 
dy — dy. 


. Let a = mqtri, b = mqgt+r2, where 0 < r1,r2 < m. Then a—b = m(qi—qe) + (11-12). 


We have a = b mod m iff m | a — 6, which is in turn equivalent to m | m(qi — q2) + 
(r1) — rg). This is equivalent to m | ry — rg. As |ry — re| < m, we have m | r1 — ro iff 
ry — rg =0, or r; = 12. This achieves the desired result. 
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3. If m; | a—6 for 1 <i <n, then a — 6 is a common multiple of the m; and hence is 
divisible by lem (m1, mz,--+ ,mM,). On the other hand, if lem (m1, mz,--- ,mMn) | a—b, 
then m,; | a — b for all i. This proves our result. 


4. Let d= gced(m,k). Write k = dki,m = dm,; then ged (mj, k,) = 1. As k(a — 6)/m = 
ki(a — b)/m , is an integer, and as gcd (mj ,k,) = 1, we must have m, | a—b. As 


ue we achieve the desired result. 


me gcd (m,k)? 


Problem 2.1.3: Verify the following congruences: 
1,09. 3° > 0 cmod. 13: 
D570 = 3 mod: 10: 
3. (20719 — 41)'° = 24 mod 100; 
4. 27> =—1 mod 641. 


Proof. 1. We have 2° = —1 mod 13. Hence 27 = 2*.- (2°)!1 = —24 = 10 mod 13. 
We have 3? = 1 mod 13. Hence 3” = 3 - (3°) = 3-13 = 3 mod 13. Hence 
27 + 3 =10+3=0 mod 13. 


2. We have 34 = 81 =1 mod 10. Hence 320% = 3 - (34)° = 3-1°°? = 3 mod 10. 


3. We have 7* = 2401 =1 mod 100. Hence 207!9 = 7! = 73 - (74)4 = 73 - 14 = 343 = 43 
mod 100. Hence 207!9 — 41 = 2 mod 100. But then (207'9 — 41)! = 21° = 1024 = 24 
mod 100. 


4. We have 641 = 5-274 1 = 54+ 24. Hence 5-2’ = —1 mod 641 and 54 = —(2*) 
mod 641. Then we have 2?) = 2°? = 24. (2")4 = —(5*)(2")4 = —(5-27)* = (-1)? =-1 
mod 641. 


Remark 2.1.4: If we define the Fermat numbers as in the lecture on greatest common 
divisors and least common multiples, it may be verified that Fo, Fi,--- ,F, are prime. The 
above result shows that Fs is not prime; it has been computed that none of Fs through Fp 
are prime. It is still an open question as to whether there are any k > 4 for which F, is 
prime. 


Problem 2.1.5: Find the last digit of: 
1. 22312 — 4415. 


2. g 1003 = 7902 ile 3801 
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Proof. 1. We have 223! = 3” = (3*)4=12=1 mod 10. Similarly, 441° = 4° = (4°33 = 
4° =4 mod 10. Hence 223! — 44° =1—4=7 mod 10, so its last digit is 7. 


2. We have 91° = (—1)1°3 = —1=9 mod 10. In addition, 79°? = 494°! = (—1)*! = — 
mod 10. Finally, 39°! = 3 - (34)? = 3-12° =3 mod 10. Hence 9103 — 79? + 3801 = 
(—1) —(—1) + 3=83 mod 10, so the last digit is 3. 


Problem 2.1.6: Prove that for integers x, y and prime p, we have (1+y)? = x+y? mod p. 


Proof. The binomial theorem gives (x+y)? = (P)a?y°+(P)aPotyl +. + + (Pee +E. 
As proved in the previous day’s lecture, p | (?) for 1 < k < p—1. Hence p | (x+y)?—(x?+y?), 
so (x+y)? =a? +y? mod p. 


Problem 2.1.7: Show that if pis a prime and 0 < k < p—1 isan integer, then Gas) =(-1)* 
mod p. 


Proof. The case k = 0 is trivial. If k > 1, we have p—1=-—1 mod p, p—2=-—2 mod p, 
and so on till p— k = —k mod p. Hence eae = (p—1)(p — 2)---(p —k) = (—1)*k! 
mod p. As ged (p, k!) = 1, we have thus (”,") = (—1)* mod p. 


Residue Classes 

Given m a positive integer, we say that two integers a and b belong to the same residue 
class modulo m if a= 6b mod m - that is, if they have equal remainder upon division by m. 
Congruence modulo m divides the set of integers Z into m disjoint residue classes, commonly 
denoted by a+ mZ for a = 0,1,--- ,m— 1 and defined asa+mZ= {a+ mk:k € Z}. 

A set S of integers is called a complete set of residue classes modulo m if for each 
0<i<m-—1 there is some s € S such that s = 2 mod ™m. It is obvious that any set S' of 
m consecutive integers is a complete set of residue classes modulo n for all 1 <n <m. 


Problem 2.1.8: Prove that: 
1. n?=0,1 mod 3; 
2. n*=0,+1 mod 5; 
3. n? =0,1,2,4 mod 7; 
4. n?=0,+1 mod 9; 
5. n*=0,1 mod 16. 
Proof. 1. For alln, n =0,+1 mod 3. Hence n? =0,1 mod 3. 
2. For all n,n =0,+1,4+2 mod 5. Hence n? =0,1,4=0,1,-1 mod 5. 


3. For all n, n = 0,+1,+2,+3 mod 7. Hence n? =0,1,4,2 mod 7. 
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4. Bor all-a := Sk, 3k b 1. i= 3h, then(3h)? = 27k? = 0 mod 9. Than = 3k + 1, 
then n? = 27k? + 27k? + 9K 41=+1 mod 9. Hence for all n, n? =0,+1 mod 9. 


5. If n = 2k, then n* = 16k* = 0 mod 16. If n = 2k +1, then n? = 14 4k + 4k? = 
14+ 4k(k +1). Since k(k + 1) is even for all k, we may write k(k + 1) = 2s. Hence 
n?=1+4+8s. Thus n* = 64s? + 16s+1=1 mod 16. 


Problem 2.1.9: Prove that if p | x? + y?, where p = 3 or p=7, then p| x and p | y. 


Proof. We first deal with the case p = 3. If 3| x, then 3 | y, and vice versa. Suppose that 
3|a2%+y? and 3f2,y. Then x?+y?=0 mod 8, and z?,y2 #0 mod 3. Hence x7 = y? = 

mod 3, so x? + y? =2 mod 3. Contradiction; hence 3 | x,y. Now we take the case p = 7. 
If 7 | x, then 7 | y, and vice versa. If 7 | 2? + y?, but 7{ x,y, then we have x? + y? = 0 
mod 7 and x7, y? = 1,2,4 mod 7. We may check that no two of {1,2,4} add to 0 modulo 
7. Contradiction; hence 7 | x, y. 


Problem 2.1.10: Let a and m be positive integers. Then S = {1-a,2-a,---,m-a}isa 
complete set of residue classes modulo m iff gcd (a,m) = 1. 


Proof. Suppose that gcd(a,m) = 1. If S is not a complete set of residue classes modulo 
m, then we have ia = ja mod ™m for i #4 j. Hence m | a(i— 7). As ged(a,m) = 1, we 
have m |i-—j. But asi > 1 and j < m, we have —(m—1) <i-—j < m-—1. Hence 
i—j = 0, so 2 = Jj. Contradiction; hence S is a complete set of residue classes modulo 
m. Now assume that S is a complete set of residue classes modulo m, and suppose that 
d = gcd(a,m) > 1. Set a = day,m = dm, where ged(a1,m ) = 1 and m, < m. Then 
we have ma = mj a,d = a,(md) = aym = ma = 0 mod™m. Hence S$ cannot contain m 
distinct elements modulo m, so S cannot be a complete set of residue classes modulo m. 
Contradiction; hence gcd (a,m) = 1. 


Problem 2.1.11: For any positive integer m, any integer a with gcd (a,m) = 1, and any 
integer b, there is some integer x with ax = b mod m. The set of all such x form a residue 
class modulo m. 


Proof. By the previous result, the set S = {a-1,a-2,--- ,a-m} is a complete set of residue 
classes modulo m. Hence there is exactly one element 7; € S with a- 7; =b mod m. Now 
we must only show that the solution set to this congruence is a residue class modulo m. 

If we have some x2 € Z with azz = b mod m, then we have ax; = ax2 mod m. Hence 
as gcd(a,m) = 1, we have x; = 2 modm. Thus 21,2 are in the same residue class 
modulo m. Conversely, if x; and x are in the same residue class modulo m, then we have 
1 =xX_ mod m. Hence b= ax, =axryg mod m, so axg = b mod m. It follows that the set 
of solutions to ax = b mod m forms a residue class modulo m. 


Problem 2.1.12: Find all solutions to the congruence: 
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1. 2x =3 mod 5; 
2. 3x =1 mod 10; 
3. 15a =5 mod 20. 
Proof. We give the following solutions: 


1. As x = 4 satisfies the congruence, and as gcd (2,5) = 1, the solution set is the residue 
class 4 + 5Z. 


2. As x = 7 satisfies the congruence, and as gcd (3, 10) = 1, the solution set is the residue 
class 7 + 10Z. 


3. As ged (15,20) # 1, we must reduce the congruence to a different modulus. We may 


reduce the congruence to 3x = 1 mod 4, as 4 = coe Then x = 3 satisfies this 


congruence; hence the solution set is the residue class 3 + 4Z. 
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EL; 
t. 


Problems 
Prove the congruences: 
(a) 27° + 37°=2 mod 11; 
(b) 13°? =1 mod 7; 
(c) (21303 — 1336)? =4 mod 11: 
(d) 2!431=2 mod 11-31 


. Determine the last two digits of: 


Gre, 
(b) 22919 + 3720, 


. Determine all natural numbers n such that: 


(a) 5 | 2" +3"; 
(b) 7| 3" —2. 


. Prove that the sequence a, = 2” — 3 for n > 0 has infinitely many terms divisible by 


5 and infinitely many terms divisible by 13 but no terms divisible by 5 - 13. 


. Determine all integers x, y, z with: 


(a) x2 + y? = 32008, 


(b) at + yt + 24 = 2100, 


. Let py < po < +++ < pg: be prime numbers such that 30 evenly divides p}+p5+---+:p3,. 


Determine pj, po, and ps3. 


. Determine all solutions of the congruence: 


(a) 54 +2=0 mod 11; 
(b) 102+ 25=0 mod 215; 


. Determine all primes p and q such that p+q = (p—q)’. 


. Let a be an odd integer. Prove that a?” + 2?” and a?” + 2?" are relatively prime for 


all distinct positive integers n and m. 


. Determine all positive integers for which n! + 5 is a perfect cube. 


Prove that if a =b modn then a” = b" mod n?. Is the converse true? 


Determine all n such that 1! + 2!+---+ n! is a perfect power. 


Chinese remainder theorem 


Lt 
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Chapter 3 


Arithmetical Functions 


arith-f 


1 Arithmetical functions 


Definition 3.1.1: An arithmetical function is a function f defined on N. 


axith-fmult f (7071) = f(m)f(n) (3.1) 
for every m and n relatively prime, then f is multiplicative. 


2. If (3.1) holds for every m,n € N, then f is completely multiplicative. 


Note that if n = p{'---p%™ is the prime factorization of n, then 


f(n) { F (pt) ---f(p%) if f is multiplicative, 
n)= 
Vito) +++ f(Dm)%™ if f is completely multiplicative. 
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Number of divisors 
Totient 


Sum of divisors 


2 

3 

A 

5 Mobius function 
6 Sums of digits 
7 


Finite calculus 


Theorem 3.7.1 (Summation by parts, Abel summation): sumparrs Suppose that wu is an 
arithmetic function, and let 


Then for m,n € N 


» u(x)u(x) = U(n)v(n) — U(m — 1)u(m — 1) - os U(a — 1)(v(x) — v(@ — 1)). 


If 0 <a <b and v has continuous derivative on a < x < b, then 


ax<a<b 


Proof. We imitate the proof of integration by parts. For a function f define the function 
A_(f) = f(x) — fle — 1). 


This is the discrete analogue of differentiation. It is the inverse of summation in the sense 
that by telescoping, 


S A-(/) = Fn) = fm =) (3.2) 
Note that A_(U) = u. We have the “product rule” 
A_(uv) = u(xz)u(x2) — u(x — 1)u(a — 1) 


= (u(x) — u(a — 1))o(@) + u(@ — 1)(o(@) — o(@ — 1) 

= A_(u)v + E_uA_(v) 
where F_ is the left shift operator (E_f)(x) = f(a —1). Replacing u by U and rearranging 
gives 


uv = A_(Uv) — E_UA_(v). 


14 


Number Theory, §3.7 


Summing over m < x < n and telescoping using (3.2) gives 
nm 


3 u(z)v(x) = U(n)u(n) — U(m — 1)o(m — 1) — SS OU (@ - 1)(e(2) — v(@ - 1)). 


When v has continuous derivative, noting U(t) = U(|t|), we have 


Y Ue=N@) -oe-I) = ¥ f" vOwa 
" U 


c=mM 


= (t)u'(t) dt. 


m—1 
For general a,b, since U is constant on (|b| ,b) and (a, |a| + 1), 
[b| 


d ulx)o(z)= DP ulx)o(a) 


a<a<b x=|a|+1 
= U((b))v([b]) —U(La})u(Lal) + i U(t)u!(t) at 


= U(b)v(b) — Ula)o(a) +f U(eo'(e at 


Interesting: (Putnam ??) Suppose that a is a real number such that all numbers 
17, 27, 3°,... are integers. Prove that a is also an integer. 
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Chapter 4 


Multiplication modulo n 


1 Introduction 


In Chapter |2) you learned how modular arithmetic works: you fix a number n, and then 
compute by taking all integers modulo n, that is, you throw away all information except the 
remainder when dividing by n. This system obeys all the “rules” we expect it to, and is 
especially well-behaved when n is a prime. 

There is a lot of similarity between addition and multiplication modulo n, but it seems 
multiplication is less “transparent.” Compare the following. 


1. We saw how we could use modular arithmetic to solve problems such as: what is the 
last digit of 7 - 2011? What is the last digit of 77°"? 


You probably see immediately the first answer is 7. To solve the second problem we 
looked for a repeating pattern in the powers of 7 modulo 10: 1, 7, 9, 3, ... We have 
7* = 1 (mod 10), so the “cycle” repeats after every 4 times, and 7771 = 7? = 3 
(mod 10). 


Making an analogy, note that if look at the sequence 7,2-7,3-7,... we get back to 0 
after 10 steps, while in the sequence 7,77, 7° we get back to 1 after 4 steps. It’s easy to 
understand how the cycle repeats when we add repeatedly (i.e., multiply), but what 
about when we multiply repeatedly (i.e., take powers)? 


In general, given a (mod n), how many times do we have to multiply a by itself to get 
back to 1 (what is the “order” of a modulo n)? How does this depend on a and n? Is 
there some general rule that determines the order? 


2. We have the notion of inverses. For instance, what is the additive inverse of 7 modulo 
19, i.ec., what do we add to 7 to get 0 modulo 19? It is simply —7 = 12 (mod 19), 
because 

7+12=0 (mod 19). 


What is the multiplicative inverse of 7 modulo 19, i.e., what do we multiply 7 by to 
get 1 modulo 19? You might have to think a bit more on this one—it is 11 (mod 19), 
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because 
7-11=77=1 (mod 19). 


In this chapter you will learn to completely determine the multiplicative structure of 
the integers modulo n. In Section |2} we will make formalize the notion of the order of an 
element: how many times we have to multiply a number to get back to 1 modulo n. We 
will develop a toolbox of basic theorems: Euler’s Theorem and Fermat’s Little Theorem in 
Section |3} and Wilson’s Theorem in Section Euler’s Theorem will give us m such that 
a™ = 1 (mod n). 

We will use these theorems to solve a variety of problems in Section For instance 
(keepin in the theme of repeating patterns) we will explain why the period of the repeating 
decimals i and a are 16 and 18: 


1 
7 0588235294117647 
16 


“4 = .052631578947368421 . 
18 


At this point we find that a lot of the stuff we’ve been doing works in a much more abstract 
setting, that of groups. In Section [5] we recast all the theorems in the language of group 
theory, and in this way give a way think about the similarities behind addition and multi- 
plication modulo n. With the language of group theory, we give a complete characterization 
of the multiplicative structure of Z/nZ in Sections {6] and 

Finally we give two applications, to primality testing (Section ??) and basic cryptography 
(Section ??). 


2 Order of an element 


sec:order 
We’ve seen that the values taken by a powers of an element modulo n form a repeating 
pattern, and under certain relative primality conditions, start each cycle at 1. For example, 


3°=1 (mod 5) 
3'=3 (mod 5) 
3°=4 (mod 5) 
3°=2 (mod 5) 
3*=1 (mod 5) 
3°=3 (mod 5), 


so the powers of 3 cycle 1, 3, 4, 2, ... modulo 5. In particular, we get back to 1 in 4 steps: 
3* = 1 (mod 5). Hence we call 4 the order of 3: 
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Definition 4.2.1: Let n > 1 and let a be an integer relatively prime to a. The order of a 
modulo n is the smallest positive integer m such that a” = 1 (mod n). In symbols, 


ord, (a) =min{m Ee N:a"=1 (mod n)}. 


For example, 
ords(3) = 4. 


Note that we place the modulus as an 
Note that the order is well-defined for all a relatively prime to n: Indeed, there are only a 
finite number of residues modulo n, so two powers of a must be equal modulo n. So suppose 


0<m, < ms, and 


my m2 


a”! =a (mod n). 


Since a is relatively prime to n, we can take inverses to find a’2~™! = 1 (mod n). 

Our first result is that the set of all positive integers k for which a* = 1 (mod m) is 
completely determined by its smallest element, i.e. the order. In the case above, the set of 
all m such that 3” = 1 (mod 5) is exactly the set of multiples of 4. 


Proposition 4.2.2: Let n > 1 anda Ln. 


1. The set of m such that a” = 1 (mod n) is exactly the set of multiples of ord,(a). In 
other words, 
a” =1 (modn) <> ord,(a) | m. 


2. The numbers 


1,a,...,a%n(@)-1 


are all distinct, and every power of a is congruent to one of these. 
Proof. Let d = ord,(a). 
1. The reverse direction is easy: If d|m, then write m = dk. We have 


a’™ = (a7)*=1*=1 (mod n). 


Conversely, suppose that a” = 1 (mod n). We use the same technique as [gcd’], 
noting that we picked ord,,(a) to be the least positive integer with this property. Using 
division with remainder, write 


m=dk+r,0<r<m. 


We have 


a =a" * =a" “=o"=1 (mod n). 


Since d is the least positive integer for which a4 = 1 (mod n), and 0 <r < d, we must 
have r = 0. Hence d | m.' 


‘For another way to phrase this proof, see Problem|s}i} 
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2. For the second part, writing m = dk +r as above we note that 
area Sa’ Gnod'n): 


If0 <r, < rg < ord,(a), then 0 < rg —1r1 < ord,(a) implies a”~"! 4 1 (mod n) and 
hence a™ 4 a™ (mod n). 


Now we have an abstract description of the numbers m such that a” = 1 (mod n). We 
know there is some positive integer with this property, and that all others are multiples of 
that number. But we would like something more concrete: is there some m depending on n, 
so that we will always have a” = 1 (mod n)? The next section will answer that question. 


3 Euler’s theorem and Fermat’s little theorem 


sec:euler-thm 


Theorem 4.3.1 (Euler’s theorem): cuicr-ineorem Let n > 1 be an integer. For any integer a 
relatively prime to n, ord,(a) | y(n) and 


a’ =1 (mod n). 


Corollary 4.3.2 (Fermat’s little theorem): «: Let p be a prime. For any integer a, 


If a #0 (mod p), then 


Let G be the set of invertible residues modulo n. We present two proofs. 


Proof 1. Let m, denote the function G — G defined by 


Ma(g) = ag. 
Note that this is an invertible function as its inverse is 


m, () =a 9g. 


Hence it is a bijection G— G. This means that the elements ag, g € G are an reordering of 


the elements of G. Hence 
Il «g= IL 9 (mod n). 
geG gEG 


Dividing both sides by [],eg g and noting |G| = y(n) gives 


a’ =1 (mod n). 
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Proof 2. The main idea is that there are y(n) possible invertible residues modulo n, and 
so the number of elements in the set H := {a” mod n:m € N} must be a divisor of y(n). 
To show this we show that “translates” of this set cover all y(n) nonzero residues without 
overlap. The fact that H has nice multiplicative structure will be essential. 

First note the following facts. 


1. 1 EH. (This is because 1 = a®.) 
2. fhe H then he H. (If h=a™ (mod n) then h7! = a-™ (mod n).) 
3. If hy, ho € H then hyhy € H. (If h; =a™ (mod n) then hyhg = a™*™ (mod n).) 


Given two nonzero residues b,c modulo p, we write b ~ c if b € H. We claim that ~ is an 
equivalence relation. We check the following. 


1. b~ b: This holds by item 1 above, since =I, 


2. Ifb~c then c~ b: This holds by item 2 above, since b =f: 


3. Ifb~candc~dthen b~d: This holds by item 3 above since 2 = : oe 
Thus G is split into equivalence classes. If C’ is an equivalence class and c is any element in 
C’, then we have 
d 
C={did~e}= {di eH} =(chihe H}, 
@ 


Since multiplication by c is invertible, C' has |H| elements. (It is the RHS that suggests the 
sets C' are “translates” of H.) 
Thus, letting [G : H] denote the number of equivalence classes, we have 


IG| = |G: HHI. 


Hence |H| divides |G| = y(n). But by Proposition 2), |H| = ord,(a). Since ord,(a) | 
y(n), by Proposition 1), we get 


a’ =1 (mod n). 


Although the first proof is shorter, the first reveals hints at some important ideas with 
broad generalizations, which we will discuss in Section [5 


Proof of Corollary Since (pp) = p—1 and the invertible residues modulo p are exactly 
the nonzero residues, we get 

a’-+=1 (mod p) 
for a #0 (mod p). Multiplying by a gives the first statement for a # 0 (mod p). If a = 0 
(mod p) the first statement obviously holds. 


Remark 4.3.3: The converse of Fermat’s little theorem is not true: if a? = a (mod p) for 
all a, then p is not necessarily prime. For example, 2'3! = 2 mod 11-31, but 11-31 is not 
a prime. Indeed, there are certain numbers n such that for all integers a, we have a” = a 
mod n with n not a prime. Such numbers are called Carmichael numbers, and the first few 
are given by n = 561, 1105, 1729, 2465. 
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4 Examples 


sec:mod-ex 


4.1 Using Euler’s theorem 


Without further ado, we give some applications of Fermat’s little theorem and Euler’s theo- 
rem. The first, most popular application is in finding large powers modulo a certain number. 
While before, we had to evaluate a, a’, a?,... until we got back to 1, our work is now shorter. 


Example 4.4.1: Find 3!%° (mod 2012). 
The prime factorization of 2012 is 27-503, so y(2012) = 2-502 = 1004. As 3 is relatively 
prime to 2012, by Euler’s Theorem 


31006 = 3? =9 (mod 2012). 


“Find big power modulo n problem” 
“Tower of exponents problem” 
Remark about “thinking backwards” 


Example 4.4.2: Show that for all primes p > 7, the number 11---1 is divisible by p. 
—S_Y 
p-l 
Solution. The key to this problem is writing an algebraic expression for 11---1. By the 
—S—’ 


p-1 
geometric series formula, 


10?-+-1 


11--To1+10+-+10 7 = 
—_—-— 9 


p-1 
Because p { 10, by Fermat’s little theorem we have 
10° *=1 mod p = p| 10? *—1. 


Because gcd (9, p) = 1, we have (10?-! — 1)/9=0 mod p as desired. 


4.2 Computing the order 


The following proposition gives practical ways to compute the order of an element. 


Proposition 4.4.3: Let n > 1, let a be an integer relatively prime to n, and set d = ord,,(a). 


1. (Power of the base) 
d 


ky _ 
ord, (a") = PoC HA 
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2. (Multiplying the base) Let d = ord,,(a),c = ord,(b). If ged (d,c) = 1, then ord,,(ab) = 


de. 


3. (Multiplying the modulus) Let the prime factorization of n be n = pftpS?---pp*. Let 


d; = ord, (a). Then 
d= lem(d;, do, ead a). 


Warning: It is not necessarily true that ord,,(ab) = lem(ord,(a), ord,,(b)). 


Proof. 1. Set m = gcd(d,k). Write d = md,,k = mk,, where gcd (dj, k,) = 1. 
t = ord,,(a*). Then we have 


(Gr \et = ght a" P41 “od. 


Set 


Hence t < d,. On the other hand, a = (a*)'=1 mod n. Then we have d | kt, hence 
d, | kit. As d,,k, are relatively prime, we have d, | t, hence d, < t. It follows that 
t = d, as desired. 


. Set e = ord,(ab). Then we have (ab)° = 1 modn. Hence (a%)(b%) = a@(b°)* = 
a~ =1 modn. Hence d | ce. As ged (d,c) = 1, we have d| e. Analogously, we have 
(a%)(b%) = (a*)*b® = b* = 1 mod n. Hence c | de, soc |e. As gcd(d,c) = 1, we 
have dc | e. However, we have (ab) = (a7)°(b°)® =1-1=1 mod n. Hence de = e as 
desired. 


. Set e = ord,(ab). Then we have (ab)° = 1 mod n. Hence (a“)(b~) = a@(b°)* = 
a =1 modn. Hence d | ce. As ged (d,c) = 1, we have d| e. Analogously, we have 
(a%)(b%) = (a%)*b® = b* = 1 mod n. Hence c | de, soc |e. As gcd(d,c) = 1, we 


have dc | e. However, we have (ab) = (a*)°(b°)° =1-1=1 mod n. Hence de =e as 


desired. 


Problem 4.4.4: Let a > 1 and n be positive integers. Show that n divides y(a” — 1). 


Proof. We have that the order of a modulo a” — 1 is n. But we have ordgn_i(a) | y(a” — 1), 


hence n | y(a” — 1) as desired. 


Note that trying to use the formula for y(m) in terms of the prime factorization of m 


doesn’t work for this problem. 


Problem 4.4.5: Determine all positive integers n such that n divides 2” — 1. 


Proof. We shall show that n = 1 is the only solution. Suppose that n | 2" — 1 for n > 1. 
Then n must be odd. Let p be the least prime divisor of n; then 2” = 1 mod p. Write 
d = ord,(2). Then d > 1 and d|n. Hence p < d since p is the least prime divisor (and hence 
least divisor greater than 1) of n. But by Fermat’s little theorem, 2?-' = 1 mod p. Hence 


d|p—1- that is, d<p-—1. Hence p < p—1. Contradiction; hence no such n exist. 
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Problem 4.4.6: Let a be a positive integer, and let p,q > 2 be primes with a? = 1 mod gq. 
Prove that either g | a—1 or g=14+4 2kp for some positive integer k. 


Proof. Obviously we have gcd (a,q) = 1. Write d = ord,(a). Then we have d | p, sod =1 
ord=p. Ifd=1, theona=1 modgq,soq|a-—1. If d=p, then we have p| y(q) = q-1, 
so g = 1+ np for some integer n. But as q > 2 is a prime, g must be odd. As p is odd, we 
must have n even for g to be odd. Writing n = 2k, we find g = 1 + 2kp as desired. 


Problem 4.4.7: Let p,q be primes with a?~!+a?-?+---+a+1=0 mod q. Prove that 
either g = p or g=1 mod p. 


Proof. If p = 2 then either g = 2 = p or q is odd and g=1 mod p. The case p > 2, q = 2 
is impossible since the left-hand expression is odd. Now we have the case where p and q 


are odd. Then we have a? —1 = (a—1)(a?"!+ a?-?+---+a+4+1), hence a? = 1 mod gq. 
Thus either g = 1 modporgq|a-1. If q|a-—1, then we have a = 1 mod gq, hence 
p14 4p? 4....414+1=p=0 mod gq, from which it follows that p = q. 


Problem 4.4.8: Let n be an odd positive integer. Prove that if n | 3°+1 then n = 1. 


Proof. Obviously n is not divisible by 3. Suppose that n > 1 and let p be the least prime 
divisor of n; then p > 5. Write d = ord,(3). As 3” = —1 mod n, we have 3°" =1 mod p, 
so d | 2n. As 3?-!'=1 mod p, we have also d | p—1. If d is odd, then we have d| n. As 
p is the least divisor of n greater than 1, we must have thus d = 1. Hence 3 = 1 mod p, 
implying p = 2. But p > 5; contradiction. Hence d must be even. Write d = 2k; then k | n, 
and if k > 1 then we have 1 < k < d < p, contradicting the fact that p is the minimal 
divisor of n greater than 1. Hence d = 2 and 3? =1 mod p, so p = 2. Contradiction; hence 
n=l. 


Problem 4.4.9: Let gcd (a,b) = 1 with b odd. Show that ged (n* +1,n° — 1) < 2 for any 
natural number n. 


Proof. Write | = gcd (n? + 1,n° — 1), and suppose that | > 1. Write d = ord;(n). Then we 
have n? =1 mod I, so d |b. Hence d is odd. But then as n*? = —1 mod I, we have d | 2a. 
Hence d|a. If d > 1 then we have d | a,b, so gcd(a,b) > 1. Contradiction; hence d = 1. 
Thus n*=1 mod l, hence 1=1 mod Il. Hence / = 1 or / = 2 as desired. 


Problem 4.4.10 (IMO 1990/3): Determine all positive integers n such that n? divides 
he el 


Proof. Clearly n = 1 is a solution. Suppose that n > 1; then n is odd. Let p be the least 
prime divisor of n, and write d = ord,(2). As 2?" =1 mod p we have d| 2n. As 2?-' = 1 
mod p we have d | p—1. If d > 2, then let q be a prime greater than 2 dividing d. Then 
q | 2n and q | p—1, contradicting the fact that p is the minimal prime dividing n. But we 
have d > 1, hence d= 2 so p= 3. 
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Write n = 3°m, with s,m > 1 and 34m. Suppose that s > 1. Then we have 
32s | a a 1) Gees = 93° (m—2) cine 93° ao 1) ; 


But since 2°” = —1 -mod 3, we have 2° ¢"-) —93(m-2) 4.9] ST lt tl Sms 0 
mod 3. Hence 378 | 23° +1. 
We claim that for all s, we have 3°+? { 2° +1. We may write 


s s e s e s 2 3° 
2 4+1= (3 — (7) why (3) Fe (,.” 9)+ (50 1)8-1) +1. 


But then we have 3°*? divides all terms in this expansion except (,.° ,)3!; hence 3°? { 2? +1. 

As we have 378 | 2° +1, we have thus 2s < s +2. Hence s = 1, son = 3m. Suppose that 
m > 1. Let q be the least prime divisor of m; then g > 5. Write e = ord,(2); then as we 
have 2?" = 1 mod q, we have e | 2n = 6m. As 271 =1 mod q, we have also that / | q— 1. 
Hence we cannot have | | n, as this would contradict the fact that q is the smallest prime 
divisor of n. Thus as gq > 5 we have / = 3 or 1 = 6, meaning that g = 7. But in this case 
we have 7 | 2” + 1; however, as n = 3m, we have 2” + 1 = (23)"+1=1"+1=2 mod 7. 
Contradiction; hence m = 1, son = 3. Thus n = 1,3 are our only solutions. 


Example 4.4.11: Let p,q,r be distinct primes such that 
pq |r? + ri. 
Prove that either p or q equals 2. 


Solution Suppose the relation holds but p 4 2,q 4 2. By Fermat’s Little Theorem, 
r? =r (mod p) and r?=r (mod q). Then since r is relatively prime to p,q, 


r?+r7=0 (mod p) => 
rt'=-1 (mod p) 
r?+r7=0 (mod gq) => 


r?-+=-1 (mod gq) 


Since —1 #1 (mod p,q), we get 


ord,(r) { gq —1,ord,(r) { p—1. (4.1) 
Since 
72-1) = 1 (mod p) 
ee Kane g); 
we get 


ord,(r) | 2(q— 1), ord,(r) | 2(p — 1). (4.2) 
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For an integer n let vo(n) denote the highest power of 2 dividing n. Let x = vp(ord,(r)) and 


y = v2(ord,(r)). From relations in (4.1) and (4.2), 


y = vo(p—1) +1. (4.3) 


y < v2(q — 1) (4.4) 


Putting (4.3) and (4.4) together, we get x < y—1,y < x—1, contradiction. 


5 Groups 


For the moment, it is helpful to “forget” where our set comes from and just work from the 
basic axioms that it satisfies. 


Definition 4.5.1: A group is a set G together with a binary operation 0, satisfying the 
following properties: 


1. (Associative law) For any a,b,c € G, 


(ob) oc= Go (box), 


2. (Identity) There exists an element id, called the identity, such that for all a, 


idoa=aocid=a. 


3. (Inverses) For any a there exists an element a’, called the inverse of a, such that 


aod =a’ oa=id. 


G is called an abelian group if additionally it satisfies the following. 
4, (Commutativity) For all a,b € G, aob=boa. 


We will be dealing exclusively with abelian groups. 
Define order, exponent. Largest order IS the exponent (for abelian groups) 
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6 Primitive roots 


We now know that a?) = 1 (mod n) for all a relatively prime to n, and that ord,(a) | y(n). 
We can ask, does there exist a for which ord,,(a) is exactly n? 

Equivalently, by the second part of Proposition since there are y(n) possible in- 
vertible residues modulo n, this says that the powers of a achieve every possible invertible 
residue modulo n. 


Definition 4.6.1: A primitive root modulo n is an integer a such that 
ord, (a) =@(n). 
For example, 3 is a primitive root modulo 5, as ords(3) = 4. 


Theorem 4.6.2: Primitive roots exist modulo n if and only if n = 2, 4, p*, or 2p* for p an 
odd prime. 

Moreover, if g is a primitive root modulo p?, then it is a primitive root modulo p* and 
2p* for any k. 


Proof. We will prove the “if’ part of the theorem. The “only if’ part will fall out from 
Theorem in the next section. 
For n = 2 or 4, we see that 1 and 3 are primitive roots, respectively. 


Part 1: Now suppose n = p is prime. We note that by Fermat’s little theorem that 
z’-'—1=0 (mod p) 


for all nonzero residues x modulo p. 

Note that if there is are elements of order d,,...,d, then there is an element of order 
lem(d,,...,d,) (Proposition ??). Hence if d is the maximal order of an element in (Z/nZ)*, 
then all orders must divide d. Hence 


z*?—1=0 (mod p). 
Now we need the following lemma. 
Lemma 4.6.3: A nonzero polynomial f(X) € Z/pZ|X]| of degree d has at most d roots. 


Proof. We induct on the degree. If d = 0 the assertion is clear. If f(X) has a root, then 


= 
is 
ql 


(X —a)g(X) (mod p) 


for some g(X) € Z/pZ|X] of degree d— 1. Now f(X) =0 (mod p) implies that one of the 
factors X — a or g(X) is 0 modulo p: this is because there are no zerodivisors modulo p. 
Hence the roots are a and the roots of g(X); the latter total at most d—1 by the induction 
hypothesis. 
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Now x? — 1 = 0 can have at most d roots modulo p, but we know all p — 1 invertible 
residues are roots. Hence d > p— 1. But we know that the order of any element divides 
p—1,sod|p—1 and we get d=p-—1. 


Part 2: Now we prove the theorem for p*. 
We first show that there is a primitive root modulo p’. Take a primitive root x modulo 
p; suppose it is not primitive modulo p?. Now 


p—1=ord,(2) | ord,a(x) | p(p — 1) 
where the right-hand divisibility is strict. Hence ord,(x) = p — 1. Now note 
ord,2(p + 1) = p, 


since (1+ p)* =1+kp (mod p?) and this is 1 modulo p? for the first time when k = p. By 
Proposition ??(2), 

ordy2(2(p + 1)) = p(p — 1) = (p") 
so z(p + 1) is a primitive root modulo p’. 

Now suppose x € Z is a primitive root modulo p”. It attains every residue modulo p? so a 
fortiori it attains every residue modulo p, i.e. is primitive modulo p. We show by induction 
that 

oP OD — phi 4] (4.5) 
for some 7 not a multiple of p. For the case k = 1, this is since x is a primitive root modulo 
p, but x?-' #1 (mod p?). Suppose it proved for k; then 


k(n ; Pp 2 Pp . : ‘ 
PO) = (phj +1)? = 14 (Pots + (2) += 1+p°G +7’) 


for some j’. This shows the claim for k + 1. Since 
p—1=ord,(z) | ord,«(z) | p*"!(p — 1) 
we know ord, (a) must be in the form p’~!(p—1) for some j. Equation (4.5) shows that j = k. 


Part 3: Note that y(2p*) = y(p*). Thus any primitive root modulo p* is automatically a 
primitive root modulo 2p*. 


Remark 4.6.4: Note the existence of a primitive root modulo n is equivalent to the fact 
that (Z/nZ)* is generated by one element, i.e. is the cyclic group Cyn). Hence if there are 
primitive roots modulo p* for all k, then the quotient maps 


aoe (Z/p®Z)* a (Z,/p?Z)* > (Z/pZ)* 


correspond to maps 
ind Cy? (p-1) —» Cp(p—1) > Cp-1. 
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Let g, be a generator for C,x-1(p_1); the kernel of the map Cx-1/p_1) -» Cpr-2(p_1) must be 
k—-2 


the cyclic group of order p generated by g? ~~). Writing « = gf, (mod p*), the conditions 
that 2 mod p” generates (Z/p”Z)* translates into the fact that j is relatively prime to both 
p(p — 1), and hence that x is a primitive root modulo p*. 

This rationalizes the last statement of the theorem, and suggests that it should be used 
to prove the existence of primitive roots. 


Remark 4.6.5: The proof of the first part can be generalized to the fact that all finite fields 
have a primitive root. See Proposition ??{11.1.1{2). 


7 Multiplicative structure of Z/nZ 


sec:mult-structure 


Theorem 4.7.1: 
1. Suppose p ¥ 2 is prime. Then 
(Z[p"Z)* % Cynuyp-1). 
2. For the case p = 2, for n > 2 we have 
(Z/2"Z)* = Cz x Con-2. 
Moreover, (Z/2"Z)* is generated by —1, which has order 2, and 3, which has order 
2”-?. The isomorphism is given by (—1)73° (a, 0). 
3. In general, 


(Z/pt? --- ponZ)* = J] (Z/pp*Z)*. 


Proof. The first follows from existence of primitive roots modulo p”. 
For the second, we show by induction that for every k > 1, 


32" — oh+2 j+l 
for some odd j. This is true for k = 1 as 3? = 8 + 1. Suppose the above holds; then 


= Qe; a 1)? on l= ae a oe) ae ie 


aan 
showing the induction step. 

Note ordgn(3) must divide |(Z/2"Z)*| = 2"~!. The above then shows that ordgn(3) = 
2"? Finally, note that for n > 3, no power of 3 is equal to —1 modulo 2”: if so, then 
by Theorem ??, 32" ° = 32°°42"(3) = —1 (mod 2”). However, 32" ° = 1 (mod 2”~!) by the 
above, so it is not congruent to —1 modulo 2”. 


The last follows from the Chinese Remainder Theorem. 
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8 Wilson’s theorem 

Wilson’s theorem is a multiplicative congruence of a slightly different kind. 

Theorem 4.8.1 (Wilson’s theorem): A positive integer p is prime if and only if 
(p —1)!=-1 (mod p). 


Proof. We may easily verify that the theorem is true for p = 2,3,4. Suppose that p > 5 isa 
prime; consider the set S = {2,3,--- ,p—2}. We will show that for any s € S there exists 
some s’ € S with ss’ =1 mod p. Indeed, given such s we set s’ = s?~?. Then we have that 


ss’ = s?-'=1 mod p. Nowifs’ ¢ S, then we have either s’ = 1 mod por s’=~—1 mod p. 
If s’) = 1 modp, then s = 1 mod p. This is obviously impossible. Similarly, if s’ = — 
mod p, then s = —1 mod p. This is similarly impossible; hence we have s’ € S. Similarly, 


if we have s,t € S with s’ = 7’, then s = t. We may see that ss’ — tt’ = (s—t)s’ =0 mod p. 
As s' £0 mod p, we must have p| s—t. As |s —t| < p, we thus have |s — t| = 0 as desired. 
Finally, s 4 s’; if s = s’, then we have ss’ = s? = 1 mod p, implying that p | s — 1 or 
p|s+1. This cannot be true, as s # +1 mod p; it follows that s # s’. 

Now we are ready to prove Wilson’s theorem. As p is odd, and as |S| = p — 3, there are 
an even number of elements in S. We pair these elements up into disjoint 2-element sets 
{51,81}, 182, 85}, °+> , {8(p-3)/2) 8(p_gy/t- These sets must contain all elements of S exactly 
once. Furthermore, when we take the product s1s/5254 --- 5(p-3) /28(p—3) j2 we will obtain 1, 
as the product of each pair is congruent to 1 modulo p. Hence we have 


(pI )h= 1+ 2+-ep = 1 = 1-878) 5285.-*-S3)28y-3)7°P — LS 1-1+-l==1 modp 


exactly as desired. 


Problem 4.8.2: Let p be a prime of the form 4k +3, and let a1, a2,--- ,a,)-1 be consecutive 
positive integers. Prove that these numbers cannot be partitioned into two sets such that 
the products of the elements of the two sets are equal. 


Proof. Suppose for a contradiction that there do exist sets X = {%1,%2,--:,%m},Y = 
{Y1,Y2;°** Yn} such that the product of the elements of X (denoted P(X )) and the product 
of the elements of Y (denoted P(Y)) are equal. If any of the a; are divisible by p, then 
exactly one of the a; may be divisible by p. In this case we have p dividing exactly one of 
P(X), P(Y), so these products cannot be equal. 

Now if p{ a; fori =1,2,--- ,p—1, then a; =i mod p. Hence 


[P(X)]? = P(X)P(Y) = 2129 +++ @mYiYo** + Yn = U1d2+++p7 =1-2---p—1 mod p. 


But from this we immediately have [P(X)]*? = (p—1)! = —1 mod p; hence we have [P(X)]?+ 
1=0 mod p, so p | [P(X)]? + 1”. As p is of the form 4k + 3, we have thus p | P(X) and 
p | 1. Contradiction; hence these numbers cannot be so partitioned. 
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Problem 4.8.3: Let p be a prime. Prove that the congruence 7? = —1 mod p has a 
solution if and only if p = 2 or p is of the form 4k + 1. 


Proof. If p = 2, the conclusion is clear. If p is of the form 4k/ + 3 and there does exist such 
an x, then we have p| x? +1, so p| x,p| 1. Contradiction; hence there are no solutions for 
p = 4k +3. Now if p = 4k +1, then set U = (2k)!. We claim that U? = —1 mod p. We 
write 


U*? =1-2---(2k)- (2k) -(2k—1)---1 
= 122492) (p= 2h)(—D p= (2k = 1-1) (p11) amod’p 
= 1-2-..(2k)-(2k+1)-(2k42)--- (4k) - (-1)* 
=(p—1)!=-1 modp 


Hence there does exist some « = U with x? = —1 mod p. 


Problem 4.8.4: Determine all positive integers p,m such that 
(p—1)!+1=p”™. 


Proof. Note that if p < 5, then we have the solutions (p,m) = (2,1), (3,1), (5,2). Now 

suppose that p > 5. Then Wilson’s theorem gives the result that p must be a prime. We 

have 2 < (p—1)/2 < p—1, hence (p— 1)? | (p—1)!. Hence (p — 1)? | p™ — 1, so 
m—2 I 


p—-1|p™1++p +---+p+1. It follows from work in previous lectures that p— 1 | m, 
hence m > p—1. Hence 


p™ > pP + >2-2-3-4---(p—2)-(p—1) = 2(p— 1)! > (P— 1)! 41, 


hence there are no solutions for p > 5. Thus the solutions given above are the only such 
p,m. 


Problem 4.8.5: Let p be an odd prime, and let A = {aj, @2,--+ ,@p_1}, B = {b1, ba, ++ , bp-i} 
be complete sets of nonzero residue classes modulo p - that is, if for some n we have p { n, 
then there exist i,j with n = a; = b;. Show that the set {a1b),--- ,@p-1bp_1} is not a 
complete set of nonzero residue classes. 


Proof. We have 
Q109°°+Ap-1 =1-2---p—1=(p—1)!=—-1 modp. 


Similarly, bjb2---b,-1 = —1 mod p. Wilson’s theorem implies that if any set S' is a complete 
set of nonzero residue classes, then the product of all of its elements must be congruent to 
—1 modulo p. But we have 


(a1b1)(a2b2) oie (@-10p-1) = (a1a2 ad Ay—1) (by b9 orig te by-1) = (—1) ‘ (—1) =] mod Pp. 


As p > 2, we have 1 # —1 mod p. Hence {a,bi,--- ,@p-1b,-1} cannot be a complete set of 
nonzero residue classes modulo p. 
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9 Problems 


Some challenging problems on order. 

(ISL 2000/N4) Find all solutions to a” +1| (a+ 1)”. 

(IMO 2000/5) Does there exist an integer n with 2000 prime divisors such that n | 2"+1? 
+variant with squarefree 

(IMO 1999/4) Solve: p prime, x < 2p, 2? | (p—1)* +1. 

(ISL 1997) Let b > 1,m An. If b™—1 and b" — 1 have the same prime divisors then 
b+ 1 is a power of 2. (In fact, stronger thing.) 

(TST 2003/3) Find all ordered triples of primes (p,q,r) such that p |g’ +1, q|r? +1, 
and r | p?+ 1. 

(IMO 2003/6) Prove that for any prime p there is a prime number gq that does not divide 
any of the numbers n? — p with n > 1. 

(MOSP 2007/5.4) Given positive integers a and c and integer 6, prove that there exists 
a positive integer x such that a® + x = b (mod c). 


1. Let p be a prime number. Find all natural numbers n such that p divides y(n) and 
etn) 
p 


such that n divides a» — 1 for all positive integers a relatively prime to n. 


32 


Chapter 5 


Diophantine equations 


Stuff I want to include in this chapter 
1. linear Diophantine equations 
2. quadratic Diophantine equations 


(a) Pell 


(b) root flipping IMO 89/6. TST 02/6. Find in explicit form all ordered pairs of 
positive integers (m,n) such that mn — 1 | m? + n?. 


(c) sum of squares 


(d) sum of 4 squares 
3. techniques: 


a) size comparison, analytical methods 


(a) 
(b) taking modulo. enumerating solutions 
(c) factoring (SFFT) 
(d) 
) 


d 
(e 


infinite descent 


Iurie’s “parameterization” trick. (IMO ??/6: Let a > b > c > d be positive 
integers and suppose 


ac+bd=(b+d+a—c\(b+d—ato). 


Prove that ab+ cd is not prime. 
(f) Constructing solutions 


(g) geometric methods (Minkowski) 
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1 Linear Diophantine Equations 


An equation of the form 
A121 ++++ + Ant, = 5, (5.1) 


where @j,...,@n,b € Z is called linear diophantine equation. 
Theorem 5.1.1: The equation (13.11) is solvable if and only if gcd(a1,...,@n) | 0. 


Proof. Let d = gcd(a,,...,a@,). If d { 6 the equation is not solvable. If d | b we denote 


b 
= —. Then gcd(a},...,a/,) = 1 and the generalized Bézout Lemma says that 


ay 
! 
“49 b »“n 


/ 
i= 
there exist x, such that ajz,+---+a)z), = 1, which implies a;z7, +---+ a,x), = d. We 


obtain a,(b'r) +---+a,(V'2'!,) = b'd=b. 


Corollary 5.1.2: Let a;,a2 be relatively prime integers. If (x?,2$) is a solution to the 
equation 
Q1X 1 + 2X2 = 5, 


then all its solutions are given by 


( 0 
= t 
ns ae Te A 
rq = 3 — at 
Problem 5.1.3: Solve the equation 
15x + 84y = 39. 


Proof. The equation is equivalent to 52 + 28y = 13. A solution is y = 1, x = —3. All 
solutions are of the form x = —3 + 28t, y=1-—5t, t € Z. 


Problem 5.1.4: Solve the equation 

3a + 4y+5z = 6. 
Proof. The equation can be written as 3x + 4y = 6 — 5z, s € Z. A solution of 3x 4+ 4y = 1 
is x = —1, y=1. So a solution of 32 + 4y = 6 — 5s is rp = 5s — 6, yo = 6 — 5s. Hence all 
solutions are 

(. =5s—6+4¢ 

ie =6-—5s — 3t 


For any positive integer a),...,@, with gcd(aj,...,a@,) = 1 denote g(aj,...,a,) to be 
the greatest positive integer N for which the equation 


Q42, +++: +antn = N 
is not solvable in nonnegative integers. The problem of determining g(a1,...,@,) is known 


as the Frobenius coin problem. 
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Problem 5.1.5 (Sylvester, 1884): Let a,b € N and gcd(a,b) = 1. Then g(a,b) = ab—a—b. 


Proof. Suppose N > ab—a-— b. The solutions to the equation ax + by = N are of the form 
(x,y) = (xo + bt, yo — at), t € Z. Let t be an integer such that 0 < yo — at <a—1. Then 
(xo + bt)a = N — (yo —at)b > ab—a—b—(a—1)b = —a. Hence 29 + bt > —1, i.e. ro + bt > 0 
and the equation has a nonnegative solution. Thus g(a,b) < ab—a-—b. 

Now we shall show that the equation 


ax + by =ab—a—b 
is not solvable in nonnegative integers. Otherwise we have 
ab =a(x+1)+b(y+1). 


Since gcd(a,b) = 1, we geta | y+1, b| +1, thusy+1>a,x2+1> 0. We obtain 
ab = a(x +1) + b(y+ 1) = 2ab, a contradiction. 


2 Pythagorean Triples 


A triple (x,y,z) of integers is called Pythagorean if 
ety? = 27, (5.2) 
Theorem 5.2.1: Any solution in positive integers of has the form 
z= (m?—n*)k, y= 2mnk, z= (m?+n?)k 
xz = 2mnk, y =(m?—n*)k, z= (m?+n7)k, 
where 
1. ged(m,n) = 1, ged(z,y) =k. 
2. m,n are of different parity. 
a mn > 0, ko 0: 


Proof. Let gcd(x, y) = k. Then x = ka, y = kb, gcd(a,b) = 1. Then k?(a? + b*) = z?. We 
get k | z and set z = kc. We obtain 
e+hP=c’. 


Suppose that a is an odd number. Then b is even since otherwise c? = a? + b? = 2 
(mod 4), a contradiction. 
Thus c is odd. We have 
b° = (c—a)(c +a), 
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which is equivalent to 


c-act+a c—a 
Note that xc ( 5 5 = 1. Otherwise there exists prime p such that p | re 
c+a Cc-a ct+a 
p | or We get p | oo oe a which implies p | b, a contradiction. Hence 
si ae asf eee 676) 
2 2 2 


and we obtain 
c=m+n?, a=m —n’, b=2mn. 


Problem 5.2.2: Solve in positive integers the equation 


1 Ths, 2ot 
gy? 2 
Proof. The equation is equivalent to 
Per ag Ge y 
u+y ( oe 
tY 


We obtain that z | ry. Hence x? + y? = 17, t = 3° 
Let d = gced(z, y,t). Therefore x = ad, y = bd, t = cd, gcd(a, b,c) = 1. We get 


bd 
a +h? =’, ease 
C 


Hence a,b,c are pairwise relatively prime and we obtain that c | d, which implies d = kc. 
Thus 
xz = kac, y= kbc, t= kc’, z = kab. 


We may assume that 
a=m —n’?, b=2mn, c=m’?4n’? 


and we obtain 


z= k(m*—n‘), y= k2mn(m? +n”), z = k2mn(m? — n’). 
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3 Size comparison and analytical methods 
4 Reducing modulo n 


5 Factoring 


Proposition 5.5.1 (Simon’s favorite factoring trick, SFFT): 
xy + br + ay+ ab =(x+a)(y+)). 
Example. 
Example 5.5.2: Which numbers n can be expressed as the difference of two squares? 


Solution. We wish to solve 


Factor this equation as 
(t+y)(u—y) =n. 
Note that x+y and x —y = (a + y) — 2y are of the same parity. If they are both odd, then 
n is odd; if they are both even, then n is divisible by 4. 
Conversly, if n is odd or n is divisible by 4, then we can write n = ab where a,b are 
factors of n having the same parity. We wish to havea = x+y and b=2-—y so set 


a+b 
xL= 
2 
= 
Y= 2 


Note these are integers by the assumption on a and b. 


6 Problems 


(Analysis) (ISL 2004) Let b > 5 be an integer and define 


ty = (11...122...25)p. 


n-1 n 
Prove that x, is a perfect square for all sufficiently large n if and only if b = 10. 
Looking at prime divisors modulo stuff: (ISL 2006/N5) Find all pairs (x,y) of integers 
satisfying the equation 
xi —1 


5 
ae 
el 


(TST ??) Prove that for no integer n is n’ + 7 a perfect square. 
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Quadratic residues 


1 Quadratic residues 


Definition 6.1.1: Let p > 2 be a prime and a an integer. The Legendre symbol (2) is 
defined as follows. 


(1, if a is a square modulo p and p{ a 
a 
(=) = { —1, ifais not a square modulo p (6.1) 
(9, if pla. 


Note (2) is pronounced “a on p.” If a is a square modulo p we also say a is a quadratic 
residue modulo p. 


Note that (2) depends only on the residue of a modulo p, so we may think of (2) as a 
map 


(*) :Z/pZ + {£1}. 


The following offers a theoretical, although impractical, way to calculate (2). 


Lemma 6.1.2: For p > 2, 


(<) Sq (mod p). 


Note this gives the actual value of ( ) since —1 £1 (mod p). 


D 
Proof. The lemma clearly holds for a = 0 (mod p). Now suppose a # 0 (mod p). Note that 
a’z = +1, since (a°z)? = 1 (mod p) by Fermat’s Little Theorem. 
First suppose a is a square modulo p. Write a = b”? (mod p). Then 


az =? 1=1 (mod p) 
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by Fermat’s Little Theorem. 
Now suppose a2 = 1. Let g be a primitive root modulo p. Then we can write a = g* 
(mod p) for some integer k. The hypothesis gives 


k(p—1) pal 
2 


=a? =1 (mod p). 


Since g is a primitive root, this implies p— 1| Me) 


modulo p. ! 


: : k : 
, ie. kis even. Then a = (b2)? is a square 


As a corollary of the preceding lemma, we obtain the following multiplicative property. 


Proposition 6.1.3: For any integers a and b and any prime p > 2, 
Geol 
p Pp) \p/- 


(2): @/pay* - {1} 


In other words, 


is a group homomorphism. 


Proof. By Lemma |6.1.2} 


(8) = t= (2)(8) 


The second statement follows from the first and the fact that (4) el 


This means that to calculate (2), we can factor a into primes 


and find that a 7 
= eo) 
Pp Pp p/ ? 


so it remains to find an easy way to evaluate ; where both p and q are prime. We do this in 
the next section. 


‘Alternatively, we can avoid the use of primitive roots as follows. In the first part we’ve shown that 


{ala°® =1 (mod p)} C (Z/pZ)”?. 


The set on the LHS has pot elements. Indeed, z?~! — 1 = 0 splits completely modulo p and has distinct 


roots, namely 1,...,p—1 by Fermat’s little theorem. Then a 1, as a factor of z?~! — 1, must have po 
distinct roots. 
It suffices to show the set on the RHS has at most pot elements. This is true since for every a, a? and 


2 
(—a)? are equal. Hence there are at most bet nonzero squares modulo p, namely 1?,..., (25+) . 
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2 Quadratic reciprocity 


Quadratic reciprocity relates (2) with (2), i.e., it gives a relationship between whether p is 


a square modulo q and whether g is a square modulo p. (See example. ADD.) 


Theorem 6.2.1 (Quadratic reciprocity): Let p 4 q be odd primes. Then 
2) (2) a(n 
(" Pp oe 


(_(p p=q=3 (mod4 
(2) () (mod 4) 


(2) ' otherwise. 


In other words, 


For the prime 2, or when p = —1, we use the following instead. 


Theorem 6.2.2 (Complement to quadratic reciprocity): Let p be an odd prime. Then 


In other words, 
(=) 7 Cy p=1 (mod 4) 
a \-1, p=3 (mod 4). 
e (1 =p =+1 (mod 8) 
\-1, p=+3 (mod 8). 


We know (4) = qh mod p) by Lemma |6.1.2| To prove quadratic reciprocity, we first 
P ss 


find an alternate way to express gq? . 


Lemma 6.2.3 (Gauss’s lemma): For an integer a and an odd prime p, define the least 


residue of a modulo p, denoted LR,(a), to be the element b € (—8, b) such that 
a=b (mod p). 


(In other words, LR,(a) is the integer of smallest absolute value congruent to a modulo p.) 
Let ys be the number of elements of { kal 1 < k < 5+} such that LR,(a) < 0. Then 


a2 =(-1)" (mod p) 


)=cn 


Al 


Hence, 
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Proof. We calculate the product 


modulo p in two ways. 
First, combining powers of a we get 


a-2a-- (P= a) =a (A=): (mod a). (6.2) 


Secondly, reducing each factor to the least residue first gives 


= i 
cael Mannion LR, (75 ; ) 


2 
—1 
= (-1)"|LR,(a |---[LR, (2 ; )| 
-—1 
= (-1) (=) (mod 7). (6.3) 
In the last step we used the fact that |LR,(a)|, . » (24) | is a permutation of 1,..., +. 
To see this, note —-LR,(m) = LR,(—m), so 
—1 
{+LRp(ka) 12ERz o—} = {+LR,(ka):1<k<p—1} 
2 {PRs PE}. 
2 2 
Hence {+LR,(ka) oh = pot} must contain one element from each pair +1,..., eS, as 


needed. 


Equating (6.2) and (6.3) and cancelling (2+)! gives the desired result. 
—l 


The second statement follows because 1 4 —1 (mod p). 


Now we prove quadratic reciprocity. 


Proof of Theorem The strategy is as follows. 


1. Establish a correspondence between x € (0,4) such that LR,(xq) < 0, with lattice 
points in a certain region (6.10). Similarly establish such a correspondence with y € 
(0, zB) such that LR,(yp) < 0. By Lemma 6.2.3} (4) (2) is the total number of lattice 

points in this region. 


2. Pair up the points in the region. We will find that there is an odd point out exactly 
when p = q=3 (mod 4). 
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Let 


By Lemma [6.2.3] 
— (2) (2) = (ymca = eae. (644) 


We would like to know the parity of 4+ [e. 


Claim 6.2.4: There is a bijection between integers x € (0, 1) satisfying LR,(ap) < 0 and 
lattice points (x, y) satisfying 


1 
O<r< 1 (6.5) 

1 
O<y< p (6.6) 
= <p = 77 <i), (6.7) 


Proof. If (x,y) satisfies the above inequalities, then inequality (6.7) gives that the the least 
residue of xp is in (-¢, 0). 


Conversely, given such a x, choose y so that yq is the closest multiple of q to xp. Then 
LR,(xp) = xp —yq, so inequality (6.7) follows. Moreover, this is the only value of y that will 


satisfy (6.7). Then follows since gives 


Pp Dog + Po, |. Pes 
0 =o ay < <3 = ; 
ra y 7 5° 4 + 5 5 
Note (6.7) is equivalent to 
Pp P44 
—~1<y<-"%+-. (6.8) 
q q 2 


Applying the claim with p and q switched, and x and y switched, inequality (6.7) becomes 
—§ < yq — xp < 0, which rearranges to 


Dp Dp Dp 
—~—-— <y<-2. (6.9) 
q 2q q 
Noting that there are no points on the line y = tr in the following region, we see that fu; + pe 
equals the number of lattice points in the region R defined by 


1 
ee ae 
2 
1 
0<y<P (6.10) 
Pp Pp Pp 1 
— a a a ey 
- Dg y ae 
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This region is symmetric around the point (e, pit). Indeed, making the change of variables 


a =x — 4 andy’ =y- 2, we get 


1 1 
BIT of 20% 


a a 
i (Bei) whe (e4) 
Hence we can pair up the lattice poms in R by matching (x,y) with (- is pet _ y) 


(this corresponds to (2’,y’) © (—2’, —y’)). The only point which would not be pated up is 


(4, pit), but this is an integer if aad only if p= q =3 (mod 4). Thus py + pg is odd iff 


p =q=3 (mod 4). In light of (6.4 (6.4), this proves the theorem. 


Proof of Theorem|6.2.4 The fact that (=) = les comes directly from Proposition|6.1.2 
To calculate (2), we can use Lemma directly. In this case yw is the number of 


elements in the set {2,4,...,p — 1} in the interval (2,p). By casework, this is even when 
p = +1 (mod 8) and odd when p = +3 (mod 8). 


Problems 


1. (IMO 1996/4) The positive integers a, b are such that 15a+16b and 16a — 15b are both 
squares of positive integers. What is the least possible value that can be taken by the 
smaller of these two squares? 


3. Jacobi symbol 
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Continued fractions 


1 Farey fractions 
2 Continued fractions 


3 Infinite continued fractions 


To evaluate a purely periodic continued fraction (@,..., Gn), we need to solve 


= Ges at) 


We know we can write the right-hand side as oath for some integers (in fact, nonnegative 


integers) a,b,c,d. What are those integers? We will calculate a recursive formula for them. 
Writing fi(z) = a; + 4, we would like to calculate fo(fi(--+ fn(xz)-++)). Note we are 

. , . _ axtb . 
repeatedly applying rational functions of the form f(x) = © vq: There is an easy way to 


repeatly apply these types of functions, using matrices. 
Note that 
6 : (") _artby _ a(5)+? 
ed - dy ¢(z 
y) cx+dy  e(#)+d 


()0)-@) = 18)-5 rs 


This immediately gives us the following. 


sO 


Proposition 7.3.1: We have 


=e An 1 ao 1 c\ a! 
(a0, 44n,2) = 5 where (4 ieee | Gee: 


More explicitly, 
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where the h,,, k, are defined by the recurrence 


hag =0 ha =1 y= aya + hy-a 

k_» =0 ky4=1 ky = ghia + kya: 
Proof. (Note that the explicit formula can also be proved directly, using induction. To make 
the induction step, note (ao,...,@n,2) = (ao,---,@n + 4). 
4 Rational approximations 


5 


Problems 


Problems 1-7 are taken from the ARML Power Round 2007. 


dL 


(a) Let K be a positive integer greater than 1. Prove that the number of purely 
periodic numbers with period 1 that are less than K is K — 1. 


(b) Prove that there are infinitely many purely periodic numbers with period 2 that 
are less than 2. 


. Is 3— V6 purely periodic? What about 2+ 6? If so, give a representation; if not, 


explain why not. 


eS (a,b) and y = (ba), what is a: 


. Assume that x = (@,..-,@m—1) and y = (G1, -.-, Gp) are two purely periodic numbers 


of period k. Show that if x satisfies Ax?+_Bxr+C = 0, then y satisfies Cy?— By+A = 0. 


. If x is purely periodic, can 2 be purely periodic? If yes, give an example; if not, give 


a proof. 


. Suppose D and E are positive integers. Let = D+ VE and = = D— VE. Prove 


that if x is purely periodic, then —1 < ¥ < 0. Is the converse true? 


. Suppose that n is a positive integer that is not a perfect square. 


(a) Show that there is at most one integer f(n) so that f(n) + /n is purely periodic. 
Is there exactly one? What is the function f(n)? 


(b) For what values of n is 1+ .4/n purely periodic? Find a purely periodic represen- 
tation of 1+ ./n for each of these values. 


. Two players A and B alternately take chips from two piles with a and 6 chips, respec- 


tively. A move consists in taking a multiple of the other pile from a pile. The winner 
is the one who takes the last chip in one of the piles. Find the conditions on a and b 
in which the first player has a winning strategy. Describe the strategy? (This is 12.34 


in [Eng98].) 
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Field and Galois Theory 


Chapter 8 


Unique factorization 


1 Unique factorization domains 


MOTIVATION 
First, we define what exactly unique factorization means. Let R be an integral domain. 


Definition 8.1.1: An element a € R is irreducible if it is not a unit, and its only factors 
are units and associates. A unit is an invertible element in R, while an associate of a is a 
unit times a. 


For the positive integers we often just say a is irreducible if a 4 1, and its only factors 
are 1 and itself. However, if we work with the integers, then there will also be the factors 
—1 and —a, and we don’t want to view these as different. For example, 5 is irreducible over 
the integers because its only factors are units, +1, and associates, +5. 


Definition 8.1.2: A unique factorization domain (UFD) is a integral domain where 
factoring terminates and every nonzero, nonunit element factors uniquely into irreducible 
elements. That is, if 

Q=Pi---Pm=N-+-Gns 
and p1,---,Pm;Q1;--+;n are irreducible elements, then m = n and we can reorder the q;’s 
so that p; is an associate of q;, for each 2. 


For example, we regard 6 = 2-3 = —2-—3 as the same factorization. 
Unique factorization doesn’t hold for all domains—for example, consider Z[,/—5], that 
is, numbers of the form a+ b\/—5. Then 


(1+ /-5)(1 — /—-5) =6 =2-3 


are two factorizations of 6 into irreducible elements. 

The notion of a prime is related to that of an irreducible element. People use them as 
synonyms in elementary math—because they coincide for the integers—but the distinction 
between them will be quite important for us. 
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Definition 8.1.3: A prime in R is an element p, not a unit, such that if plab then pla or 
plo. 


This tells us that if a prime p divides a, then no matter how we factor a, we can’t 
avoid p dividing one element of a. The connection between primes, irreducibles, and unique 
factorization is given by the following. 


Lemma 8.1.4: If R is a ring where factoring terminates, and every irreducible element is 
prime, then R is a UFD. Conversely, in a UFD, every irreducible element is prime. 


Proof. Suppose a = p,...Pm = --+-Q are two factorizations into irreducible elements. 
Since p, is irreducible, it is prime, and hence must divide one of the q;. Since q; is irreducible, 
its only factors are units and associates, so p; must be associated with g;. Then we can cancel 
them, leaving a unit. Repeating this process, every factor in the left factorization is paired 
with one in the right factorization. 

For the converse, suppose p is irreducible and p|ab. Then pd = ab for some d. Factoring 
a, b, and d shows that p must divide one of the factors of a or b by unique factorizaton. 


(Note that primes are always irreducible, because if p = ab were a proper factorization, 
then p{ a and p{b.) 
The main strategy for proving unique factorization is the following. 


1. Show that the ring R in question (here, K[]) admits division with remainder, with 
some measure of size so that the remainder is smaller than the quotient. 


2. Show that if we have division of remainder, then greatest common divisors exist, 
and moreover that they have the nice property given by Bézout’s Theorem. 


3. Show that this implies that all irreducible elements are prime, and hence R is a UFD. 


The advantage of such an abstract approach lies in the fact that it works for a variety of 
different number systems. In particular, once we’ve shown items 2 and 3, then given any 
ring, we only have to show that we can have division with remainder, and it will follow 
that it is a UFD. This simultaneously shows unique factorization for Z, K|a], and even 
Z{i] = {a + bila, b € Z}." 

In the language of abstract algebra, the above steps are phrased as follows: 


1. Ris an Euclidean domain. 
2. An Euclidean domain is a principal ideal domain. 
3. A principal ideal domain is a unique factorization domain. 


We now carry out this program. 


'The converse is not true; a UFD is not necessarily a PID or Euclidean domain. For example Z| 
is a UFD but not an Euclidean domain. 


if v= 163) 
2 
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1.1 Step 1: Euclidean domains 


Definition 8.1.5: An integral domain R is an Euclidean domain if there is a function 
|- |: R — No (called the norm) such that the following hold. 


1. |a| = 0 if a=0. 
2. For any nonzero a,b € R there exist g,r € R such that b = aq +r and |r| < |al. 


Note that both the integers Z and Kx] are Euclidean domains. The norm on Z is simply 
the absolute value, while the norm on K[z] is the degree of the polynomial. Theorem [9.2.1] 
shows that A [2] is an Euclidean domain. 


1.2 Step 2: Euclidean domain = PID 


We'd like to prove Bézout’s Theorem for an Euclidean domain, that given a,b in R there 
exists a greatest common divisor g and s,t so that as + bt = g. Rather than thinking of this 
as an equation in variables s,t, we can think of it as an equation in sets (a) and (b), where 
(x) denotes the set of multiples of x. For two sets S,T we define S+T = {s+t|s € S,t € T}; 
then it turns out what we want is 


(See Lemma below.) 


Definition 8.1.6: An ideal in a ring R is a subset J such that if a,b € J then ra,a+beTl 
for any r € R. A principal ideal is an ideal generated by one element, that is, there is a a 
such that J = {ralr € R}. We write I = (a). 

A principal ideal domain (PID) is a integral domain where every ideal is principal. 


Theorem 8.1.7: An Euclidean domain is a PID. 


Proof. Let R be an Euclidean domain, J C R and ideal, and b be the nonzero element of 
smallest norm in J. Suppose a € J. Then we can write a = qgb+r with 0 <r < |b], but since 
b has minimal nonzero norm, r = 0 and bla. Thus I = (0) is principal. 


Lemma 8.1.8: A PID satisfies Bézout’s Theorem. 


Proof. Let R be a PID. Since every ideal in FR is principal, for every a,b (not both 0) we 
have (a) + (b) = (d) for some d € R. (Note the sum of two ideals is an ideal—check this for 
yourself.) This says there exist s,t € R such that 


as + bt = d. 


From this, any divisor of a,b must divide d. Furthermore, d must divide both a and b since 
a =a+0 and b=0+0are both in (a) + (b) = (d). In other words, d is the greatest common 
divisor of a, b. 
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1.3 Step 3: PID= UFD 
Theorem 8.1.9: A PID is a UFD. 


Proof. Suppose p is irreducible; we show p is prime. Suppose plab but p does not divide 
a. Then using Bezout’s Theorem and the fact that a and p are relatively prime, we get 
as + pt = 1 for some s,t. Multiply by 6 to get 


abs + ptb = b. 


Since plablabs, p|ptb, we have p|b. This shows that irreducible elements are prime in Z. 
It remains to show factoring terminates.” Otherwise, there would be an infinite sequence 


of nonassociated elements a, da2,... € Rsuch that aj,,|a;. Then (a1) C (a2) C ---. However, 
Ujs1(@i) is an ideal, so it is principal, say generated by b. Then b € (a;) for some 2; this 
implies that (b) = (a;). Hence (a;) = (ai41) = +++, a contradiction. 


Since irreducible elements are prime and every nonzero element of R factors into irre- 


ducibles, R is a UFD. 


Corollary 8.1.10: Z and K[z] are UFDs. 


2 Example: 77+ 7?=n 


Theorem 8.2.1: Let n be a positive integer. Then the equation 


has a solution in integers iff every prime p = 3 (mod 4) appears in n with even exponent. 
If n = 2%? .- - pre gh --- qe" where p; and q; are primes congruent to 1, 3 modulo 4, then 
the equation x? + y? = n has 
4(by +. 1)--- (bp +1) 


solutions in integers. 


Proof. Each solution to x? + y? = n corresponds to a factoring (x + yi)(x — yi) = n over the 
Gaussian integers Z[i]. Thus the number of solutions is the number of z such that 27 = n, 
or 4 times the number of nonassociated z € Z|i] such that zz = n. (Two Gaussian numbers 
are associated if they differ by a unit +1, +7, so r+ yi, —y+21, —x — yt, y— x1 are considered 
the same.) 

Now factor n = 2%"! --- pr gi’ +: go” where p; and q,; are primes congruent to 1, 3 modulo 
4, respectively. From knowledge of factoring in Z[i] we know that 


1. 2 ramifies in Z[7], that is, it is the product of two associated primes 1 + i, 1 —1. 


?This argument is not needed for our purposes: Both Z and K[z] are Euclidean domains, and factoring 
must terminate for them because factors always have smaller norm (absolute value and degree, respectively). 
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2. The p; = 1 (mod 4) split, that is, p; = z;Z; where z is prime in Z[i] and not associated 
to Z. 


3. The g; =3 (mod 4) remain prime. 


Now if z7 = n and a Gaussian prime divides z, then its conjugate must divide z. Thus, 
since we have unique factorization in Z|i], each such z, up to multiplication by associates, 
corresponds to a way of splitting the prime factors of n into complex conjugate pairs. We 
note the following: 


1. The factors gq; are their own conjugates, so z and Z must each get qj /? TE one of the 
c; is odd there is no solution. So we suppose they are all even. 


2. It doesn’t matter how the prime factors of 2% are split since they are all associates. 


3. There are b; +1 ways to split the factors of q;’ , since we can have either a or a) Fe 


or z; divide z. Thus there are (b; + 1)--- (bg, +1) solutions to zZ = n up to associates. 


A similar argument works for the equations x? + 2y? =n and x74+ xy+y? =n. 


3 Problems 


1. (The power of ideals) We rephrase some earlier results that used the “division with 


remainder arguement” in terms of ideals. 


(a) Let n > 1 and a be relatively prime to n. Show that 
{m:a™=1 (mod n)} 
is an ideal. 


(b) Conclude Proposition 


2. For which n does x? + 2y? = n have a solution? How many solutions are there? How 
about 2? + xy + y? =n? 
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Chapter 9 


Polynomials 


1 Gauss’s argument 


We’ve shown that A’[z] is a UFD, but the argument above does not show that Z[z] is a 
UFD, because division with remainder fails for Z[x]. We will need a further argument. The 
basic idea is that a polynomial factors in Z|x] the same way it does in Q[z], except with its 
factors adjusted by constants so the coefficients are in Z. 

Let R be a UFD and let K be the field of fractions of R. That is, AK consists of the 


numbers ¢ where a,b € R and b # 0, and we say — = § iff ad = be. For example, Q is the 


field of fractions for Z. 


Definition 9.1.1: A nonzero polynomial f € R[x] is said to be primitive if all its coeffi- 
cients do not have a common proper divisor; equivalently, there does not exist a prime p € R 
such that p|f. 


Lemma 9.1.2: If R is an integral domain, then so is R[z]. 


Proof. Take any p,q € R[x] not equal to 0. We can write 
p=) a2", am #0 
i=0 
— S> bx", by 0 
i=0 


Then the leading coefficient of pg is dmb,x2"*". It is nonzero because since R is an integral 
domain, dm, b, #0 imply that a,b, 4 0. Hence pg £ 0. This shows that R[x] is an integral 
domain. 


Lemma 9.1.3 (Gauss’s lemma): (A) An element of R is prime in R[x] iff it is a prime in 
R. Hence if a prime p of R divides a product fg of polynomials in R[x], then p|f or plg. 
(B) The product of primitive polynomials in R[z] is primitive. 
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Proof. If p € R is nonzero, and a prime in Riz], then it is a prime in the subring R. 

Conversely, let p be any prime element in R. Then R/(p) is an integral domain 
lemma 9.1.2} R/(p)|a] is an integral domain. 

Suppose p|fg for f,g € Riz]. Then in R/(p)[z], fg = fg = 0. Since R/(p)[z] is an 
integral domain, either f = 0 or g = 0. In other words, either p|f or plg in R[x]. Thus p is 
a prime in R[z]. 

If f,g are primitive, then p{ f and p{ g for all primes p € R. Since p is also prime in 
Riz], p{ fg. Hence fg is not divisible by any prime in R, and it is primitive. 


' so by 


Lemma 9.1.4: Every nonconstant polynomial f € K|z| can be written uniquely (up to 
multiplication by units) in the form f = cfo, where c € K and fo is a primitive polynomial 
in R[z]. 

Proof. Each coefficient a; of f is in the form a where p;,q; € R. We can find a nonzero 


t € R such that t is divisible by each denominator (for, example, take t to be the product of 
the denominators). Then we can write 


tf=fi, 


where f, € Riz]. Let s € R be a greatest common divisor of the coefficients of f,;. Then we 
have 
S 
f=" fo 
in K |x] where fo € R[x] and the coefficients of fp have no common divisor. This gives the 
desired representation. 


Next we check uniqueness. Suppose 
ii = Cfo = Clas 


where c,c € K and fo, ff € R[x] are primitive. Multiply by an element of R to “clear 
denominators,” to reduce to the case where c,c’ € R. Now take any prime plc. Since p is 
prime in R[x], pic’ or p| fp. The second is impossible since fj is primitive. Hence plc’, and 
we can cancel p. Continuing in this way, we get that c and c’ share the same prime factors 
with the same multiplicities. Hence c,c’ are associates. 


Lemma 9.1.5: Let fo be a primitive polynomial and let g € R[z]. If folg in K [a] then folg 
in R[z]. 


‘Tf I is an ideal, then R/I is the quotient ring: Two elements a,b in R are considered to be the same in 
R/T if they differ by an element in J. Keep in mind the example R = Z; then R/(p) is simply the integers 
modulo p. 

Now R/(p) is an integral domain, because if ab = 0 in R/(p), then ab € (p), i.e. p divides one of a,b. But 
since p is prime either pla or p|b, which translates back into a = 0 or b= 0 in R/(p). 
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Proof. If folg in K [a], then we can write g = foh where h € K |x]. We need to show h € Riz]. 
By lemma 9.1.4] we can write h = cho, where c € K and ho is primitive. Then g = cfoho. 
By lemma/9.1.3} the product foho of primitive polynomials is primitive. We can write c = 5, 
where s,t € R have no common factors. If a prime p in R divides the denominator ¢t then 
p {sso p|foho, contradicting the fact that foho is primitive. Hence t is a unit, and c € R. 
Then h = cho € R[x], so folg in Riz]. 


Lemma 9.1.6: Let f be a nonzero element of Riaz]. Then f is an irreducible element of 
Ria] iff it is an irreducible element of R or a primitive irreducible polynomial in K[z]. 


Proof. If f € R, then the only factors of f in R[x] are in R, so f is irreducible in R iff it is 
irreducible in R[x]. This proves the lemma for f € R. Now suppose f ¢ R. 

If f € R[x] is a primitive polynomial irreducible in K’[z], then it is irreducible in R{z]. 

If f € Rl] is not primitive, then it is reducible in R[x]. Thus it suffices to show if 
f € Ria] is reducible in K[z], then it is reducible in Riz]. Suppose f € Rix], and f = gh 
is a proper factorization of f in K[a]. We can write g = cgo,h = Cho where c,c’ € K and 
go, ho are primitive. Since go and hg are both primitive, so is ggho. Then f = cc’(goho), so by 
uniqueness in lemma 9.1.4} cc’ must be in R (and is the gcd of the coefficients of f). Thus 
f = (cc’)goho is a proper factorization of f in R[x] as well, as needed. 


Theorem 9.1.7: The ring R{2] is a unique factorization domain. 


Proof. It suffices to show that every irreducible element f of R[x] is a prime element, and that 
factoring terminates. By Lemma 9.1.6} f is either irreducible in R or a primitive irreducible 
polynomial in K [x]. In the first case f is prime in R (R is a UFD) and hence prime in R[z] 
by Lemma 


In the second case, f is primitive irreducible in K[z], thus a prime in K[z], since [2] 
is a UFD. Hence fg or f|h in K[a]. By Lemma |9.1.5] flg or f|h in R[x]. This shows f is 
prime. 

A polynomial f € R[x] can only be the product of at most deg(f) many polynomials p; 
of positive degree in R|a] because the sum of the degrees of the p; must equal deg(f). Factor 
terminates for the factors of f in R because factoring terminates in the UFD R, and the 
primes in R dividing f are the primes dividing every coefficient of f. 


Hence R[x] is a UFD. 


Corollary 9.1.8: Z|z] is a UFD. 
If Risa UFD then R[x,...,2n] is a UFD. 


Proof. Since Z is a UFD, so is Z[x]. The second statement follows from Theorem [9.1.7] by 
induction. 
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1.1 More Proofs 


Theorem 9.1.9 (Chinese Remainder Theorem): If polynomials Q1,...,Q, € K[z] are pair- 
wise relatively prime, then the system P = R; (mod Q;),1 < i <n has a unique solution 
modulo Q1---Qn. 


Proof. Let Q = Q1---Qn. Note Q; and é are relatively prime. Hence by Bézout’s Theorem 
there exist f; and g; so that 
Q 


a 


fiQe 9; 
Now 


(1 a afi) Ri = Rise, 


is congruent to R; modulo @;, and zero modulo Q, for 7 #7. Hence 


n 


P= - af) Ri 
i=1 
is the desired polynomial. 
For uniqueness, suppose P; and P, satisfy the conditions of the problem. Then P; — Py, is 
zero modulo Qj. Since the Q; are pairwise relatively prime, P,— P, = 0 (mod Q,--: Qn). 


Theorem 9.1.10 (Rational Roots Theorem): Suppose that R is a UFD and K its fraction 
field. (For instance, take R = Z and kK = Q.) Suppose f(z) = ana" +--+ + a9 € Rix] and 
an #0. Then all roots of f in K are in the form 


factor of ao 
factor of an 


In particular, if a, = +1, then all roots of f in K are actually in R. 


r 


Proof. Write x = = in simplest terms. Then multiplying through by s” gives 


an (“)" aaa a (“) + ag = 0 


as ~2 -1 
dgr” ==s(dy ar? + bars”? + aps” *). 


Since s and r have no common factor, s must divide a,. (This uses the fact that R is a 
UFD—how?). Rewriting as 


aps” = —Tlayr’ hays”) 


makes it clear r divides do. 


Remark 9.1.11: In particular, if a, = 1, then all roots of f in K are in R. A ring is said 
to be normal if whenever t € K is a root of a monic polynomial in R[x], then t € R. Thus 
the above shows that UFDs are normal. 
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1.2. Problems 


1. (Bézout bound) Let f(x,y), g(x,y) € Clz,y]. Prove that either f,g have a constant 
nonzero factor, or they have finitely many zeros (x,y) in common. (Hard: They have 
at most deg(f) deg(g) common zeros.) 


2. For a field K, let K (a) be the field of rational functions, that is, 


K(x) = {2 [page Kl 


Let f and g be rational functions such that f(g(x)) = ax. Prove that f and g are both 


in the form ae with ad F be. 


In many ways, polynomials are similar to integers. Like integers, polynomials admit 
division with remainder, existence of greatest common divisors, and unique factorization. 


2 Main Theorems 


In this section K will stand for C (the complex numbers), R (the real numbers), Q (the 
rational numbers), or Z/pZ (the integers modulo p), while R will stand for any one of the 
before sets or Z (the integers). Note that the sets we label with Kk all have multiplicative 
inverses, i.e. are fields. 

Our first result is that when we divide polynomials, we can be assured to get a remainder 
with degree smaller than our divisor. 


Theorem 9.2.1 (Division with remainder): If f,g € Az], then there exist polynomials 
q,r € K|z] such that degr < deg g and 


fgg: 


If f,g € Zia] and g is monic, then there exist g,r € Za] such that degr < deg g and 


f=9q+r. 


Proof. This is the division algorithm familiar from high school algebra class. Namely, if f 
has leading term ax” and g has leading term br™ with n > m, then f — ¢2"~™ has degree 
less than f. Thus we can keep subtracting multiples of g from f until the result has degree 
less than deg g. 

If g is monic, then b = 1 so at each stage we subtracted an integer polynomial multiple 
of g, and both the quotient g and the remainder r will have integer coefficients. 


Theorem 9.2.2 (Bézout): Given f,g € R[x], there exists a polynomial h, called the great- 
est common divisor and denoted gcd(f, g), such that the following hold: 


1. h divides both f and g. 
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2. If p divides both f and g then p divides h. 
Let f,g € K[z]. There exist polynomials u,v € K[z] so that uf + vg = gced(f, 9). 


(Note that h is only determined up to a unit. We’ll “sweep this under the rug” and allow 
any choice of h up to that constant.) 

To calculate the gcd, we often use the Euclidean algorithm. Given polynomials f and g, 
for any polynomial g we have 


gcd(f,g) = gcd(g, f — 9g). 


Supposing deg f > deg g, take q so that f — qg =r has degree less than g, as in the division 
algorithm; this reduces the degree of f. Repeating this process decreases the degrees of the 
polynomials; we eventually get to gcd(h,0) in which case the answer is seen to be h. 


Theorem 9.2.3 (Unique factorization): Every polynomial in R[x] factors uniquely in R[z], 
up to constants. In fact, every polynomial in R[x,...,2,] factors uniquely in R[r,...,2n], 
up to constants. 


We give two more useful results. 


Theorem 9.2.4 (Chinese Remainder Theorem): If polynomials Qi,...,Qn € [a] are pair- 
wise relatively prime, then the system P = R; (mod Q;),1 < i <n has a unique solution 
modulo Q;---Qn. 


Theorem 9.2.5 (Rational Roots Theorem): Suppose f(x) = a,7"+-+--++ a9 is a polynomial 
with integer coefficients and with a, #0. Then all rational roots of f are in the form 


factor of ap 
factor of ap. 


In particular, if a, = +1, then all rational roots of f are integers. 
Here’s a cute application of Bézout’s Theorem: 


Example 9.2.6: Let f,g be polynomials with integer coefficients and with no common 
factor. Prove that gcd(f(n), g(n)),n € Z can only attain a finite number of values. 


Solution. By Bézout’s Theorem, we have u(x) f(x) + v(a%)g(x) = 1 for some u,v € Q/z] 
and nonzero. Clearing denominators of u and v, we get u/(x) f(x) + u'(x)g(x) = k for some 
u’,v’ € Zia] and nonzero k € Z. Hence ged(f(n), g(n)) | k. 


2.1 Problems 


1. [1] Show by example we cannot always carry out division with remainder in Z[z] and 
that Bézout’s Theorem does not hold for Z[z]. 
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2. 


10. 


EL. 


[1] Compute the greatest common divisors in Z[z]: 


(a) ged(x® — 2° — 2? + 1,2? — 277 + 2x — 1). 
(b) ged(a!* — 1,28 + 1). 


. Find the greatest common divisor in Z|z]: 


(a) [2] ged(x™ — 1,2 — 1). 
(b) [2.5] gced(e#” + 1,2” + 1). 


Are your answers the same if we work in (Z/pZ)|z|? 


. [1.5] Let n > 0 be an integer. Find the remainder upon division of x” + 2"! +---+1 
by: 
(a) 2? +1. 
(b) 227+a+1. 


(c) x —a#+1. 


. [2.5] Let f,g be relatively prime polynomials with integer coefficients. Prove that there 


exist nonzero polynomials u,v with integer coefficients such that uf + vg = k where k 
is a nonzero integer. 


Suppose that uw) f+vig = ko and uy, v are integer polynomials with u, = 0”, a;2",v = 
og bx", deg(ui) < deg(g), gcd(ao,.--,@m,bo,---,bn) = 1. Prove that ko | k. 


- [3] Let f: Q— Q satisfy f(f(f(a))) + 2f(F(@)) + fle) = 4e. and f(f(--- f(@))) = & 


where f is taken 2009 times. Prove that f(x) = x. 


3] (BAMO 2004) Find all polynomials f with integer coefficients taking irrationals to 
irrationals. 


5] (USAMO 1997/3) Prove that for any integer n, there exists a unique polynomial Q 
with coefficients in {0,1,...,9} such that Q(—2) = Q(—5) =n. 


n° +1000 
n—10 


2| For how many integers n is an integer? 


iw) 


Suppose that f and g are integer polynomials such that f(n)/g(n) is an integer for 
infinitely many n € Z. Show that as polynomials, g(x) divides f(z). 


5] (IMO 2002/3) Find all pairs of integers m > 2,n > 2 such that there are infinitely 
many positive integers a for which a” + a? — 1 divides a” +a —1. 
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3 Arithmetic Properties 


In this section we concentrate on polynomials with integer coefficients. The following is a 
simple but very useful idea. 


Theorem 9.3.1: If P has integer coefficients, then a — b | P(a) — P(b) for all integers a, b. 
Proof. Let m=a—b. Then a=b (mod m). Let P= cpr" +--+ +c," + co. Then 
Cn” + +++ + Ca+ Co =Cpb" +---+c1b+¢ 9 (mod m) 


giving P(a) = P(b) (mod m), as needed. 


Here is a typical application. Note the use of the extremal principle. 


Example 9.3.2 (USAMO 1974/1): P(x) is a polynomial with integral coefficients. If a, b,c 
are integers so that P(a) = b, P(b) =c, P(c) =a, prove that a=b=c. 


Proof. If not, then no two are equal. Without loss of generality, assume that c is between a 
and 6. Then 
|P(a) — P(®)| = |e— 8] < |b—al. 


However, b — a | P(b) — P(a), a contradiction. 


Example 9.3.3: Let P be a nonconstant polynomial with integer coefficients. Prove that 
there is an integer x so that P(x) is composite. 


Proof. Take n so that P(n) is nonzero. Suppose it is prime. For all k € Z, we have 
P(n) | P(n+kP(n)) — P(n), and hence P(n) | P(n+kP(n)) . If P(x) is not composite for 
any integer x, then P(n+kP(n)) is +P(n) or 0 for all k € Z. P attains one of these values 
infinitely many times, so must be constant, a contradiction. 


One question we could ask is what values a polynomial can take modulo a given integer 
m as x ranges over the residues modulo m. (From Theorem we know that the value 
modulo m depends only on x modulo m.) We know by the Lagrange Interpolation formula 
that we can manufacture a polynomial taking arbitrary values at a given set of points if we’re 
allowed to divide—so it works for R,Q, and even Z/pZ. However Lagrange Interpolation 
will not work modulo m for m composite because in general we cannot divide modulo m 
(for example, 2 has no inverse modulo 4). For instance, Theorem already tells us that 
given P(x), P(x +p) cannot be any residue modulo p?; it can only be those residues that 
are congruent to x modulo p. 


Example 9.3.4 (TST 2007/6): For a polynomial P(a) with integer coefficients, r(2i — 1) 
(for 7 = 1,2,3,...,512) is the remainder obtained when P(2i — 1) is divided by 1024. The 
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sequence 
(1) 4703) 45.057 (1023))) 


is called the remainder sequence of P(x). A remainder sequence is called complete if it is a 
permutation of (1,3,5,...,1023). Prove that there are no more than 2°° different complete 
remainder sequences. 


Solution. Step 1 
For 7 € N, let 
P(x) = [[ (e - (2k 1)). 
k=1 
(Define Po(x) = 1.) By Problem 7, any polynomial with integer coefficients can be written 
in the form Vo<j<n CiPi(x). 


Step 2 

Let a; = 5 | se |. We claim that 2% | P;(x) for all i € N and all odd x. For a prime p 
and n € Z, denote by v,(n) the exponent of the highest power of p dividing n (by convention 
Up(0) = oo). For given odd zx let f(a) be the number of values of k (0 < k <7i— 1) where 
2° |2—1-—2k. Then 


i-1 


v2(P;(2)) = $5 ve(a — 1 - 2k) = 


k=0 Qa 


Ma 


f(a) 


1 


since each k with 2° || « — 1 — 2k is counted a times in either sum. 
Since any set of 2°! consecutive even integers has one divisible by 2°, any set of 1 
consecutive even integers has at least || integers divisible by 2°. Hence f(a) > ls], 


and v2(P,(x)) > X29 | 4] as desired. 


Qe 
Note ag = 0, aj = las = 3,03 = 4,04 = "1, o5:= 8; and a; > 10 for7> 6. 


Step 3 

Next, we claim that if P(x) = No<j<n G:P;(x) has a complete remainder sequence then 
c, is odd. (co obviously needs to be odd.) We have 4 | P(4k +7) — P(i) for any integer 7; 
hence r(4ké + 1) = r(1) (mod 4) and r(4k + 3) = r(3) (mod 4) for each k. In order for the 
remainder sequence to be complete, we need r(1) # r(3) (mod 4). But noting that a; > 2 
and P;(x) = 0 (mod 4) for odd x and i > 2, we have P(3) — P(1) = c,(Pi(3) — Pi(1)) = 2a 
(mod 4). Hence c; is odd. 


Step 4 

Since for any odd x, P;(x) is divisible by 2%, if we mod out c; by 2'°~%, and delete the 
terms with P; for i > 6 (where a; > 10), we get a polynomial with the same remainder 
sequence as P;. If P(x) gives a complete remainder sequence, then cy is odd, so there are 2° 
choices for it; c; is odd, so there are at most 2° choices for c; (mod 2°) (a; = 1); for2<i<5 
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there are at. most 2'0~% choices for c; (mod 2'°-“). Hence the number of complete remainder 
sequences is at most 


29.98 TI 210-0 — 29.98 97.98 99.9? — 995, 
1=2 


Rather than ask about polynomials with integer coefficients, we could ask about polyno- 
mials with integer values, that is P such that P(n) is an integer whenever n is an integer. 
It turns out that there is a nice description of such polynomials, as the following example 
shows. 


Theorem 9.3.5: Let f(x) € C[x]. Then the following are equivalent: 
a. For every x € Z, f(x) € Z. 
b. For n + 1 consecutive integers x, where n is the degree of f, f(x) € Z. 


c. There are ao, a1,.--,@, € Z with 


f(z) =a,(*) tari(*) be tao( 8). 


Here (") is defined as 


Proof. The assertions (a) = (b) and (c) = (a) are clear ((*) are integers for all integers x 
and nonnegative integers i, by combinatorial argument). 
Suppose (b) holds. First assume that f(x) takes on integer values at 0,1,...,n. We 


inductively build the sequence do, a,,... so that the polynomial 
Mo x ab 
Pele) = on() tamil : Ste +a0(§) 
matches the value of f(x) at = 0,...,m. Define aj = f(0); once ao,...,@m have been 
defined, let 


Om+1 = f(m+1)— Pr(m t+ 1). 


Noting that cewel equals 1 at xy = m+ 1 and 0 for 0 < x < m, this gives Pr4i(x) = f(z) 
for x = 0,1,...,m +1. Now P,(zx) is a degree n polynomial that agrees with f(x) at 
x =0,1,...,n, so they must be the same polynomial. 

Now if f takes on integer values for any n + 1 consecutive values m,...,m-+n, then by 
the argument above on f(a — m), f(x) takes on integer values for all x; in particular, for 


x =0,1,...,n. Use the above argument to get the desired representation in (c). 
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The key idea here in both examples that once we know that P(x) = R(x) at some points 
X1,...,X%n, then we can write 


P(x) = R(x) + (@— a1)-+- (@ — tn)JQ(2). (9.1) 


When we’re working over Q or R, (9.1) doesn’t put a restriction on other values of P, but 
when we’re working over Z or Z/mZ, then it does. For instance, if we’re working over Z 
and 21,...,%, are integers, then we know P(x) and R(x) have to differ by a multiple of 
(x —21)-+-(4@— Zp). 


3.1 Problems 


1. [1] Suppose P is a polynomial with integer coefficients such that P(0) and P(1) are 
both odd. Show that P has no integer root. 


2. [2] (Schur) Let P be a nonconstant polynomial with integer coefficients. Prove that 
the set of primes dividing P(n) for some integer n is infinite. 


oO 
SS 


Polynomial P(x) has integer coefficients, and satisfies P(2) = 18 and P(3) = 20. 
Find all possible integer roots of P(x) = 0. 


4, [3] (Putnam 2008) Let p be prime. Let h(x) be a polynomial with integer coefficients 
such that h(0), h(1),..., h(p?—1) are distinct modulo p*. Show that h(0), h(1),...,h(p?— 
1) are distinct modulo p?. 


5. [4] (IMO 2006/5) Let P(x) be a polynomial of degree n > 1 with integer coefficients 
and let k& be a positive integer. Consider the polynomial 


k times 


Prove that there are at most n integers such that Q(t) = t. 


6. [4] (MOSP 2001) Let f be a polynomial with rational coefficients such that f(n) € Z 
for all n € Z. Prove that for any integers m,n, the number 


f(m) = Flr) 


m— mn 


Icm|{1, 2: te , deg(f)] : 
is an integer. 


7. [2] (Helpful for the next few problems) Let f(x) € Ri], and let po, pi,... be a sequence 
of polynomials whose leading coefficients ug, u1,... are units (ie. invertible), and 
deg(p;) = 7. Show that f can be uniquely written in the form 


f(x) = anpn(a) +... + aypi (x) + agpo(2). 


In particular, this is true for p;(x) = xt = a(x — 1)---(w —i+1). 
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A 


10. 


Ls 


12; 


. [2.5] How many polynomials of degree at most 5 with integer coefficients satisfy 0 < 


P(x) < 120 for x =0,1,2,3,4,5? 


. [4] (USAMO 1995/4) Suppose qo, qi, q2,.-. is an infinite sequence of integers satisfying 


the following two conditions: 
(a) m—n divides qm — dn form >n > 0, 
(b) there is a polynomial P such that |¢,| < P(n) for all n. 
Prove that there is a polynomial Q such that gq, = Q(n) for each n. 


[5] (TST 2008/9) Let n be a positive integer. Given an integer coefficient polynomial 
f(x) define its signature modulo n to be the ordered sequence f(1),...,f(m) modulo 
n. Of the n” such n-term sequences of integers modulo n, how many are the signature 
of some polynomial f(z) if n is a positive integer not divisible by the cube of a prime? 
(Easier variant: if n is not divisible hy the square of a prime) 


[5] (variant of TST 2005/3) For a positive integer n, let S' denote the set of polynomials 
P(a) of degree n with positive integer coefficients not exceeding n!. A polynomial P(x) 
in set S is called fine if for any positive integer k, the sequence P(1), P(2), P(3),... 
contains infinitely many integers relatively prime to k. Prove that the proportion of 


fine polynomials is at most 
1 
a) 
prime p<n pP 
(Original statement: Prove that between 71% and 75% of the polynomials in the set 
S are fine.) 
[5] Suppose f(a) is a polynomial of degree d taking integer values such that 
m—n| f(m) — f(n) 
for all pairs of integers (m,n) satisfying 0 < m,n < d. Is it necessarily true that 
m—n| f(m) — f(n) 


for all pairs of integers (m,n)? 


Polynomials in Number Theory 


We give an interesting application of polynomials to number theory. Recall the following. 


Theorem 9.4.1 (Victa’s Theorem): Let r1,...,7 be the roots of 7), a;2", and let 


sj = S- Ty Tig. 


1<i1 <...<ijy<n 


Then s; = (—1)/)=. 


an 
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Theorem 9.4.2 (Wolstenholme): Prove that a = ey (mod p?) for prime p > 5. 


Proof. By Fermat’s Little Theorem, x?~' = 1 (mod p). Thus in Z/pZ, 


ge t_1= The —i) (mod p). (9:2) 


i=1 
Write (« — 1)! = 9?5) ajax’. Then matching coefficients on both sides of (9.2) gives 
a;=0 (mod p) for alll <i<p-—1. (9.3) 


Since p > 5, letting x = p gives 
p-2 
(p— 1)! = (a —1)24 = pt + (x oa +ap+(p—1)! 
i=2 


since (—1)(—2)---(—p+ 1) = (—1)?-'(p—1)! = (p—1)!. Subtracting (p — 1)! on both sides, 


Using (9.3), p> | ap’ for 2<i<p-—1. Hence, since p > 5, p® | p?-! + P73 ap’. Since p® 
divides the LHS, p® | ayp and p? | a,. Now p® | (kp)?! + (3 a;(kp)') as well and we get 


(kp — 1)P= = (@ — 1)? |e=pe 


= (kp)P-* + (S a + akp + (p — 1)! 
=(p—1)! (mod p’). (9.4) 


Now, 


aps il(pt) (pi — 1)P*] 
TT} [(pt) (pi — 1)2=4) 

a i [p(i+a—b)—1]24 

(Oo iW (pi — 1)2=4 | 


By (9.4), [p(¢ +a — 6) — 1]2" = (pi — 1)2 (mod p?). Hence (9.5) becomes ($) modulo p’, 


as needed. 


; (9.5) 
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Theorem 9.4.3 (Lucas’s Theorem): Suppose that the base p expansions of m and n are 


Then 


Proof. We have the identity 
Dee YS Ve Es 8 ee 
Now take both sides modulo p and use the fact that (1+X)?" = 1+ X?" (mod p) to obtain 
(14+ X)™= (14 XP*)™.~.. (14 X?)™(14+ XY™ (mod p). 


Now match the coefficients of X” on each side. The coefficient on the left hand side is 7) 
For the right hand side, note the only way to get the term X” is by choosing X "59P” from the 
term (1+ X?’)'™/, simply by uniqueness of base p representation; the coefficient of X"/”’ is 
Co) Hence the coefficient of X” on the right hand side is ee vee Ce) ae Equating the 


n n1/ \no 


coefficients gives the desired result. 


Corollary 9.4.4: Let n be a positive integer, and let B(n) be the number of 1’s in the 
binary expansion of n. Then the number of odd entries in the nth row of Pascal’s triangle 
ig 27), 
4.1 Problems 

1. [3] Prove that for prime p > 5, 


Pie! (1 | ; pte ). 


p-l 


2. [3.5] (APMO 2006/3) Prove that for prime p > 5, (°°) = p (mod p’). 


3. [3.5] (ISL 2005/N3) Let a,b,c, d,e, f be positive integers. Suppose that the sum S = 
a+b+c+d+e+f divides both abc + def and ab+ bc+ca—de—ef — fd. Prove 
that S is composite. 


4. [5] (China TST 2009/3) Prove that for any odd prime p, the number of positive integers 
n satisfying p | n!+ 1 is less than or equal to cp3, where c is a constant independent 
of p.? 


?Hint: A polynomial of degree n over a field (such as Z/pZ) can have at most n zeros. 
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5. [4-5] (TST 2002/2) Let p be a prime number greater than 5. For any positive integer 


x, define 
p-1 t 


Prove that for all positive integers x and y the numerator of f,(x) — f,(y), when written 
in lowest terms, is divisible by p’. 


(MOSP 2007/2.2) Let d be a positive integer. Integers t,,t2,...,tq and real numbers 
@1,...,@q are given such that 

ait} + dot, +--+ + aat?, 
is an integer for all integers 7 with 0 < 7 < d. Prove that 


a,t? + agt? 5 ie agt4 


is also an integer. 


5 Resultant 


Definition 9.5.1: Let R bea UFD, and let A(x) = anxv™+---+a9 and B(x) = b,x"+---+bo 
be in R{a]. Define 


[0 dea 0 0} 
0 “Sg: “i 88 0 
0 0 ao am 
M(A,B) = bo --- b OO O O 
0 0 0 
0 O ‘ ‘ 4 an 0) 
lo Os GY. sOp.¢ seak: fam: a 


where the first n rows contain the a; and the last m rows contain the b;. The resultant of 
A and B is 
Res(A, B) = det(M(A, B)). 


Note that Res(A, B) is homogeneous of degree n in ao,...,@m and homogeneous of de- 
gree m in bo,...,b,. The main use of the resultant is that considering it as a function of 
o,-++;4m,b0,..-,6n, it tells us when A and B have a common factor. 


For homogeneous polynomails, we write A(z, y) = GmX™+---+aoY¥™ and B(zx,y) = 
b,X" +---+ bY” and define the resultant the same way. 


Proposition 9.5.2: 


1. Res(A, B) = 0 if and only if A and B have a common factor, i.e. have a common zero 


in K. 
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2. If dmbn #0, and A = ay Ti \(X — aj), B= bp Th1(X — Bx), then 
Res(A, B) = ab" T] TT (ay — 8). 


3. There exist polynomials F’,G € Rlao,..., bn|[X] such that 


FA+GB = Res(A, B). 


Proof. 


1. A and B have a common factor in KX] if and only if there exist polynomials nonzero 
polynomials C(x) = ¢p—.X™ 1 +++++ cg and D(x) = dy_1X""1 +--+ +dy is K[X] 
such that AC = BD and degC < m-—1 and deg(D) < n—1. Multiplying out 
AC = BD and treating it as a system of linear equations in the c; and d;, we get 
that the determinant of the coefficient matrix is + Res(A, B). Thus there is a nonzero 
solution if and only if Res(A, B) = 0. 


2. First assume a, = b, = 1. We considering the coefficients a,,b, and Res(A, B) as 
functions of the roots, aj,...,Q@m,(1,...,8n. By Vieta’s formulas, a,,_, is homoge- 
neous of degree k in aj,...,Q@m and b,_; is homogeneous of degree k in (1,...,bn. The 
“big formula” for the determinant says that Res(A, B) is the sum of entries of the form 
fe Myo(k), Where M = M(A, B). 


e For 1 <k <n, Myz.o(x) is either a polynomial of degree a(k) — k in the a; or zero, 
and 


e For l <k <m, Mniko(n+k) is a polynomial of degree a(n +k) —k in the 6;. 
Hence if 17" Mz,o(x) #0, then it has degree 
m m+n n m 


Yo (a(R) =) + So(a(n +b) =k) = "5 ofmn-+ n) — Sok = So k= mn 


= k=1 k=1 k=1 


Now when a; = (£;, then by part 1, Res(A, B) = 0. Hence a; — 6; divides Res(A, B). 
By comparing degrees, we must have 


Res(A, B) = C Il Il (aie Bp): 


j=l k=l 


To compute C, note that there is exactly one term in the determinant that gives 
a” = (a,---Qm)", so C = 1. By scaling, the desired result holds for am, by 4 0. 
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3. We have 
oe 
Xmrn-i : 
; A 
M(A, B) : xXx7!1B 
1 
B 


Let C denote the cofactor matrix of M(A, B). Multiplying by C7 on both sides gives 


X14 
Xmrn-i 
r| <A 
Res(A, B) : =C xm-lp 
if 
B 


Let the coefficients of F,G be given by the bottom row of C’. Then multiplying out 
the matrices and looking at the bottommost entry gives the desired conclusion. 
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Chapter 10 


Field Theory 


sea An extension of a field F is a field containing F’. 
1. A number field is a subfield of C. 
2. A finite field has finitely many element. 


3. A function field is an extension of C(t). 


1 Algebraic elements 


Definition 10.1.1: Let L be an extension of K and a be an element of L. a is algebraic 
over Kx if it is the zero of a polynomial in A [z], and transcendental otherwise. 


Note a is transcendental if and only if the substitution homomorphism y : K[z] — L is 
injective. 
2 Degree of a field extension 


Definition 10.2.1: The degree [L : K] is the dimension of L as a K-vector space. 


3 Fundamental theorem of algebra 
Theorem 10.3.1: C is algebraically closed. 


In other words, every nonconstant polynomial with coefficients in C has a zero. Equiva- 
lently, every nonconstant polynomial with coefficients in C splits completely. 


Proof. We first show that all polynomials with real coefficients are reducible over the complex 
numbers, by induction on the highest power of 2 dividing the degree. For odd degree, the 
the statement follows since the polynomial has different signs near at --oo. Now assuming 
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the induction hypothesis, suppose deg(f) = 2*m where k is odd. Choose a splitting field L 
of f, and write P(x) = (a —1r,)---(a —T,). Consider the polynomial 


P(x)= JJ (@-m-—1r; -trir;). 
i<i<j<n 

Its degree is mae) = 2'-lm(n — 1). Since its coefficients are symmetric polynomials in 
the r;, by hypothesis it has a complex zero, i.e. rj + 7; + trir; is real for some 7,7. Since 
this is true for infinitely many values of t, we must have that r; + 7; + tr;r; is real for all t 
some i,j. This means 7; +7; and r;r; are both real. Then r;,r; are roots of the quadratic 

x? —(r; +r;)v + rir; so they are complex roots of P(x). This concludes the induction. 
Next for an arbitrary polynomial P(x), consider the real polynomial P(x) P(x). (We take 
the conjugate of the coefficients, not x.) By the above, it factors entirely into linear factors. 
P(x) divides P(x)P(x), so it splits as well. 


4 Constructions 
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Finite fields 


1 Finite fields 


A finite field is a vector space over F,, for some prime p, so has order q = p". The (unique) 


field of order g is denoted by F,. 


Proposition 11.1.1: 


it 


vA 


6. 


. The elements in a field of order g are roots of x? — x = 0 (everything is modulo p). 


F* is a cyclic group of order q — 1. 


q 


. There exists a unique field of order q up to isomorphism. 


. A field of order p” contains a subfield of order p* iff k | r. (Note this is a relation 


between the exponents, not the orders.) 


. The irreducible factors of «4 — x = 0 over F,, are the irreducible polynomials g € F,,[X] 


whose degrees divide r. 


For every r there is an irreducible polynomial of degree r over F,. 


Proof. 1. The multiplicative group F, of nonzero elements has order g — 1. The order of 


any element divides g— 1 so a?~* = 1 for any a € Fy. 


2. By the Structure Theorem for Abelian Groups, F? is a direct product of cyclic sub- 


groups of orders d, |---| d,, and the group has exponent d;. Since 7?" — 1 = 0 has at 
most d; roots, k = 1 and dj =q-1. 


. Existence: Take a field extension where x? — x splits completely. If a,@ are roots of 


xi —x =0 then (a+ 8)1=a+ B. Since —1 is a root, —a is a root. The roots form a 
field. 
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Uniqueness: Suppose K, A’ have order qg. Let a be a generator of K™*; then kK = F(a). 
The irreducible polynomial f € K |X] with root a divides 4—x. x1—x splits completely 
in both K, K', so f has a root a’ € Kk’. Then F(a) = F[z|/(f) = F(a’) = K’. 


CF, = > k|r: Multiplicative property of the degree. 


oe e 
Pye CF» =k |r: p’—1| p*—1. Cyclic F% contains a cyclic group of order p*. 


Including 0, they are the roots of x?" — x = 0 and thus form a field by 3a. 


5. = >: Multiplicative property. 


<=: Let 8 be a root of g. Ifk | r, by 4, F, contains a subfield isomorphic to F(8). g 
has a root in F, so divides x? — z. 


6. Fy (¢ =p") has degree r over F, and has a cyclic multiplicative group generated by an 
element a. F,,(a@) has degree r over F,,. 


To compute in F,, take a root 3 of the irreducible factor of #?—z of degree r; (1, 8,..., 6"~') 
is a basis. 
Let W,(d) be the number of irreducible monic polynomials of degree d in F,. Then by 2, 
p= 3 dW, (d) 
d\n 


By Mobius inversion, 


Theorem 11.1.2: The Galois group G(F,-/F,) is cyclic of order r generated by the Frobe- 
nius automorphism 


oe) = 2%, 
Definition 11.1.3: Let L be a field extension of AK. An element a € K such that L = K(a) 
is a primitive element for the extension. 


Theorem 11.1.4 (Primitive element theorem): Every finite extension of a field K contains 
a primitive element. 


Proof. Need a general proof! 


2 Quadratic reciprocity via finite fields 


We work in F,. Since (2) = pt, we will explicitly find an element a such that a? = +p. 
Then (2) als 
qd 
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Let ¢, be a primitive qth root of unity and consider the Gauss sum 


All inverses below are modulo qg. We calculate a4! in two different ways. 


Step 1: We calculate 


7 if 


i") c= (2) is group homomorphism 
| q 


£(E) 7 


(When s = j the terms are 0.) 


1. When s # 0, noting 1 — sj~! ranges over F, — {1} when s ranges over F, — {0}, we 


have 
(ae) FOE 


j=1 j=l 
=F (2) 
j=l q 


The last step comes from noting that there are as many quadratic residues as non- 
residues. 


2. When s = 0, we have 


(52) (2) Eanecavenn 


j=l q 


Hence the sum (11.1) equals 
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and 
Pt = (2) = (1g = (eg = (ye (2), 


| 
= (=) : since p=1 (mod 2) 


so aP-l= (2). 
Equating the results of stpes 1 and 2 gives the result. 


3 Chevalley-Warning 


Lemma 11.3.1: 


acF, 


Proof. Ifq—1|n then a” = 1 for all a € Fy, so the sum is 0. 
If q—1 { n then (since FS = Z/(q — 1)Z) there exists G6 € FY such that 8” F 1. 
Multiplication by @ is a bijection on F, so 


¥ a= ¥ (ap\"6" Fo a”. 


ackg ack ackg 


Thus the sum must be 0. 


Theorem 11.3.2 (Chevalley-Warning): Let fi,... fx € Fa[X1,..., Xn] be polynomials with 


do deg( fj) <n. 


j=l 
Let Vii fiisknts Fe = Pelt aa): Fl Digeeg ey) — 0 for ally}. Phen 
Vil fis rade) | = Cand =p): 


In particular, there is a nontrivial point in V(fi,..., fx). 


'This result says that finite fields are C, fields. 
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Proof. We engineer a polynomial that is 1 when x € V(fi,..., f,) and 0 otherwise: 


Pins a) = Ta — fj(X1,...,Xn)*"). 


j=l 


Indeed, 


sO 


Lira ha 


r=? $ Git 


and multiplying gives the desired conclusion. 


Hence we can count the number of points in V(fi,..., fn) as follows: 
Vilivescata)| = Sy . . P@iysces ain): (11.2) 
(21,.-,0n)EFF 
Note : 
deg P = (q— 1) }/ deg(fj) < (q—1)n 
j=l 
so each term in P(X1,..., Xp) is in the form 


al a 
Ker KO 


with a1 +---+@, < (¢—1)n; this means a; < q—1 for some j. Then Vy cp, 27 --- 25" =0 


(mod p) by Lemma }11.3.1] so (after summing over the other x;) this term contributes 0 


modulo p to the sum in|11.2} Summing over all terms gives the result. 


Theorem 11.3.3 (Erdés-Ginzburg-Ziv): From any set of 2n —1 integers there exist n whose 
sum is divisible by n. 


Proof. We first prove the result for n = p prime. 
Let S = {aj,...,@2p-1}. Associate a subset T to any (2p — 1)-tuple (a,...,22p,-1) € F, 
where x, 4 0 iff a, € T. We will translate the condition on T into equations in the xp. 
Consider 


fi(z) = aR tee t+ tee 


=i = 
fo(x) = a,x +++++ Gop-1 255-1 


in F,. The first equation is 0 iff |Z| = 0 (mod p), while the second is 0 iff },-pa = 0 
(mod p). We have deg f; + deg fz = 2(p—1) < 2p —1 so by Chevalley-Warning the number 
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of solutions is a multiple of p. Since (0,...,0) is a solution, there must be another one. That 
solution must correspond to a subset of size p and hence satisfies the required conditions. 

Next suppose that the theorem holds for m,n relatively prime; we show it holds for mn. 
Given r > 2m — 1 elements, by assumption there will be a subset JT’ of m elements whose 
sum is divisible by m. We start with a set S of 2mn-— 1 integers; continue to pick subsets of 
size m as described. After k steps we will have m(2n — k) — 1 elements, so we will be able 
to carry out 2n — 1 steps and get 


ee wes plat 
Let the sums of elements of these sets be t1,...,tan—1. By the hypothesis for n, we can find 
a subset of n elements, say t;,,...,¢;,, with sum divisible by n. Then 

T;, U+++UT;, 


has mn elements and sum divisible both by m and n, hence by mn. 


Problems 


1. If k is infinite and P is a nonzero polynomial in k[x,,...,2,], then there exist ti,...,tn 
such that Pins .+5t,) 20. 


Solution: Induct on n. For n = 1, the polynomial can have at most n roots so the 
assertion holds. Suppose it’s proved for n—1 and P € kijty,..., tp]. Since k[t,,...,tr-1] 
has infinitely many elements, thinking of P as a polynomial of t,, with coefficients in 
k{t1,...,tn-i], some element in k[ti,...,tn-1] is not a zero of P. Set t, to be this 
element to get a nonzero element of k/ti,...,tn-i]. By the induction hypothesis we 
can find values for t),...,t,—1 so that the polynomial does not evaluate to 0; substitute 
these values into the polynomial for t, to get ty. 


4 Problems 


1. (Harvard Quals, 2013/2.4) 


(a) Let K/F be a field extension of degree 2n +1 generated by t. Prove that for every 
c € K there exists a unique rational function f € FT] such that deg(f) <n and 
c= f(t). (The degree of a rational function f is the smallest d such that f = 6 
for polynomials P, Q each of degree at most d.) 

(b) Deduce that if [AK : F'] = 3 then PGL»2(F’) acts simply transitively by fractional 
linear transformations on K\F (the complement of F in Kk). If |F| = q < ov, 
compute PGL»(F’) directly, and verify that it equals || — |F'. 
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Galois Theory 


1 Galois groups and Galois extensions 


Definition 12.1.1: The Galois group of an extension L/K, denoted 
Gal(L/K) = G(L/K) 
is the group of field automorphisms of L fixing K. 


Also talk about stuff from an embedding point of view. 
Definition 12.1.2: A Galois extension of K is a normal, separable extension. 


Theorem 12.1.3: Suppose L/K is a finite field extension. L/K is a Galois extension if and 
only if 
IG(L/K)| = [: K}. 


2 Fixed fields 


Definition 12.2.1: Let H be a group of automorphisms of a field K. The fixed field of 
H, K", is the set of elements of K fixed by every group element. 


K" = {a € K :o(a) =a for every o € H}. 


The following relationship between H and K¥” will be instrumental in proving the Fun- 
damental Theorem of Galois Theory. 


Theorem 12.2.2 (Fixed field theorem): 1. [K : K"] = |H|: The degree of K over kK” 
is the order of the group. 


2. H=G(K/K"): K is a Galois extension of K” with Galois group H. 


Proof. 
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3 Splitting fields 

Definition 12.3.1: A splitting field of f € A[X] over K is an extension L/K such that 
1. f splits completely in K: f = (X —ay)---(X — an), ax € K. 
2) d= K Gig n2s, Oy). 


A splitting field is a finite extension, and every finite extension is contained in a splitting 
field. 

The following shows that the splitting property of a splitting field is in a sense independent 
of the polynomial chosen. This will help us relate splitting fields to Galois extensions. 


Definition 12.3.2: A field extension L/K is normal if every polynomial g(X) € K[X] 
with one root in K splits completely in Kk. 


Theorem 12.3.3 (Splitting theorem): The normal extensions L/K are exactly the splitting 
fields of polynomials in K[X]. 


Move this to before Galois extensions so “normal” extension is defined. 


Proof. Suppose g(X) has the root 8 € K. Then pi(ay,...,Q@n) = 6 for some p; € K[X,..., Xn]. 
Let p,,..., px be the orbit of p; under the symmetric group. Then Tt. (X—pi(ar, 1.+;Qn)) € 
K(X] by symmetry so it is divisible by g(X), the irreducible polynomial of (. 


The order of G = G(L/K) divides [L : K], since 
[ok |= [bere 2k). 
IG| 


Theorem 12.3.4 (Characteristic properties of Galois extensions): For a finite extension 
L/K, the following are equivalent. Merge with |G| = [L: kK]? 


1. L/K is a Galois extension. 
2, LEV) =, 
3. L is a splitting field over K. 


Proof. (1) <> (2): By the Fixed Field Theorem, |G| = [L : L°]. 

(1) <= (8): Let 7 be a primitive element for L with irreducible polynomial f. Let 
V1,---;Yr be the roots of f in L. There is a unique K-automorphism o; sending 7, > 7; 
for each 7 and these make up the group G(L/K’). Thus the order of G(L/K) is equal to the 
number of conjugates of 7; in L. Hence we get the following chain of equivalences. 


1. L/K Galois 
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2y (G] = (he ES 
3. f splits completely in kK 


4. K is asplitting field. 


Proposition 12.3.5 (Properties of the Galois group): If L/K is a Galois extension, and 
g € K|X] splits completely in L with roots 61,...,8,, then 


1. G operates on the set of roots 6;. 
2. G operates faithfully if Z is a splitting field of g over kK. 
3. G operates transitively if g is irreducible over K. 


4. If L is the splitting field of irreducible g, then G embeds as a transitive subgroup of 
ope 


4 Fundamental theorem of Galois theory 


Theorem 12.4.1 (Fundamental theorem of Galois theory): Let L/K be a finite Galois 
extension and let G = G(L/K). Then there is a bijection between subgroups of G and 
intermediate fields, defined by 


Hw K® 
G(L/K') aK, 


Moreover, letting K’ = K”, K’/K is a Galois extension iff H is a normal subgroup of G. If 
so, then G(K'/K) = G/H. [Diagram here.] 


Proof. Let 7 be a primitive element for K’'/K and g the irreducible polynomial for y, over 
K. Let the roots of g in K be 91,...,7,. For 0 € G, o(%1) = 1, the stabilizer of 4; is 
oHo'. Thus cHo! = H if and only if y, € K’ = K". H is normal iff all y; € L iff K’/K 
Galois. Restricting o to L gives a homomorphism y : G > G(A’'/K) with kernel H. 


Definition 12.4.2: A normal basis of a Galois extension L/K is a basis in the form 
{o(8) :0 € G(L/K)} 
for some 6 € L. 


Theorem 12.4.3 (Normal basis theorem): Every Galois extension has a normal basis. 
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Proof. Write G(L/K) = {o1,..., 0m}. Consider two cases. 


Case 1: K is infinite. We show the following. 


Lemma 12.4.4: If f © K[X1,...,Xm] is such that f(oia,...,%ma) = 0 for all a € EF, then 
f=U. 


Proof. Let X = (Xi,...,Xm)7; we write f(X) for f(Xi,...,Xm). Let Y = (%1,...,¥n), 
and define 
O1Q, ‘+1: O1QAm 
gY)=9> 2 3 |X. 
Cri 2% Onn, 


Then by assumption g(a) = 0 for each a € K. Since K is infinite g is the zero polynomial. 


O1Q, +: 01m 
Add proof. Note the matrix ( ) is invertible (this is corollary of indep. of char 


OmQ1 + FmAm 


- ADD). Hence f must also be the zero polynomial. 


Let A be the matrix with X; in entry (i,7) if o;00; = o,. Let 
F(X, inate ,Xm) — det (A). 


Note f(1,0,...,0) is the determinant of a permutation matrix (since given any g,h ina 
group, there is exactly one element k and one element / so that lg = gk = h), so equals +1. 
This shows f is not the zero polynomial. Therefore, by Lemma [12.4.4] there exists a € K 
such that f(o1a,...,%ma) #0. Suppose that a1,...,@m € AK and 


S- Apn0%(a) = 0. 
k=1 


Then - 
S- Ap0;0%(a) = 0 
k=1 


for all 2. Think of this as a system in the a;. The matrix corresponding to this system is 
det(A) 4 0, so all the a; = 0. This shows that o;,(a) are linearly independent. 


Case 2: K is a finite field. Then the Galois group is cyclic (Theorem ??); say G = (co). By 
independence of characters, I,a,...,0"~' are linearly independent so the minimal polynomial 
of o is X" — 1. Consider L as a K[o| = K[X]/(X”" — 1)-module. By the structure theorem 
for modules, we have 

L= K|X]/(p1) @--- ® K[X]/(Pm) 


for some polynomials (p,) dividing X” — 1 with p, |---| pm. Since the minimal polynomial 
of o is X" —1, we must have p, = X” —1. But [L: K]} =nsom=1 and L& K[X].' This 
means there exists an element a such that a,ca,...,0" 'a generate L over K. 


‘Compare this to the proof of primitive elements in finite fields. 
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5 Cubic and quartic equations 


6 Quintic equations 


Theorem 12.6.1 (Quintic impossibility theorem): 


7 Inverse limits and profinite groups 


To study infinite Galois groups, it is fruitful to view them as the “limit” of finite Galois 
groups. Thus we first introduce the notion of an inverse limit. This gives infinite Galois 
groups the structure of a profinite group, and its topology becomes important. 


7.1 Limits 


We will eventually care about limits not just for abelian groups but also topological groups, 
modules, and so forth. To take care of all this in one fell swoop, we introduce a bit of 
abstraction, via category theory. 


Definition 12.7.1: A category C is a collection of objects and morphisms (or maps). 
Each morphism y has a source and target object A and B; let Home(A, B) be the set of 
morphisms from A to B. There is a composition law 


Home(A, B) x Home(B, C) + Home(A, C) 
(a, 8) 4 Boa 


satisfying the following: 


1. For each object B there exists an identity morphism 1s € Home(B, B) such that 
lpoa=a for any a € Home(A, B) and Golg = 6 for any 6 € Home(B,C). 


2. Composition is associative: 
yo (Boa) =(yoB)oa 
for any a € Home(A, B), 6 € Home(B,C), and y € Home(C, D). 


A morphism a € Home(A, B) is an isomorphism if there exists 6 € Home(B, A) such 
that Gow~=1, andaoB=1p,. 


Example 12.7.2: 2 We can often think of the objects as sets, possibly endowed with extra 
structure, and morphisms as maps between them preserving the structure. 


?We have to be careful about the word “sets”... See for all the stuff we’re sweeping under the 
rug. 
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1. ((Sets)) Objects: sets. Morphisms: functions. 
2. ((Rings)) Objects: rings. Morphisms: ring homomorphisms. 
3. ((R-mod)), where R is a ring. Objects: R-modules. Morphisms: ring homomorphisms. 
4. ((Groups)) Objects: groups. Morphisms: group homomorphisms. 
(a) ((Ab Groups)) Objects: abelian groups. Morphisms: group homomorphisms. 


5. ((Top)) Objects: topological spaces. Morphisms: Continuous maps. 


6. ((Top Groups)) Objects: topological groups.® 


phisms. 


Morphisms: Continuous homomor- 


However, objects in categories do not have to be sets. For instance, any poset S can be 
turned into a category, by letting the elements be the objects, and declaring a morphism yi 
whenever 2,7 € S andi = 7. 


Definition 12.7.3: Let C be a category. Let {Aj} and {y‘} be a set of objects in C and 
homomorphisms between them.* We say that ({A,}, {y;}) form a inverse (or projective) 
system if the following two conditions are satisfied. 


1. For every A; # A; there exists A; such that there are morphisms y? : A, — A; and 
yF : A; = Ag 


2. For every pair of maps yi, : Aj + A, and yi : Aj — Ax there exists a map” ai, : Aj > 
A; such that yj, o yi = 0 pe. 


In our applications there will only ever be one map A; — A;, so the second condition is 
empty. 
Finally, we define the notion of inverse limit. 


3Groups endowed with a topology such that multiplication is continuous on G x G and taking the inverse 
is continuous. 

4We’re allowed to have different maps between A; and Aj; however in our examples we usually won’t. 

5called the equalizer 
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Definition 12.7.4: Let {A;} and {y‘} be a set of objects in C and homomorphisms between 
them. Suppose that {yi} is closed under composition.® We say a sequence of maps a; :. A > 
A; is compatible if for every map yi : A; — A; in our set of maps, 


a xt 
a; = Pj CO Ai. 


The inverse limit 


is the unique object in C (up to isomorphism) with compatible maps a;, satisfying the 
following universal mapping property (UMP): For every object B with compatible maps §;, 
there isa map y: B > A such that 8; = a; 0 for every 7, i.e. the following commutes: 


(12.1) 


cone 


This is a very abstract definition, but we will be able to construct A explicitly in the 
cases we care about. Uniqueness follows from the UMP; the inverse limit exists for all inverse 
systems if and only if C has products and equalizers. (See 18.705 notes.) 


Theorem 12.7.5: inv-timit-comp-seq Suppose C is ((Sets)), ((Groups)), ((R-mod)), or ((R-alg)). 
If ({ Ai}, {y}) is an inverse system, then lim A; can be realized as the set of all sequences 


a;): a; € A;, vi(a;) =a; for all yi}, 
j j j 


with the natural module or algebra structure, as applicable. 


Proof. Just verify that the UMP is satisfied. 


Example 12.7.6: The ring of p-adic integers 
Ly = lim Z/p"Z 


is defined the inverse limit of ({Z/p"Z}nez, yr,) where yr, : Z/p"Z > Z/p™Z, for n > m, 
are the natural projection maps. An element of Z, can be thought of as a number modulo 
arbitrarily high powers of p. We have an injective map Z — Z,, but there are elements of 
Z, not in Z (think this through). 

We will explore p-adics in depth in Chapter [20} 


SEquivalently, the A; and yi are indexed by a category, i.e. there is a functor from a small category into 
C. 
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Example 12.7.7: na: Define 
L= lim Z/ nZ, 


as the inverse limit of ({Z/nZ}nez, pn,), where the maps vy”, : Z/nZ > Z/mZ with m | n 
are given by projection. 


In the next section we will interpret these limits not just as limit of groups, but of 
topological groups. 


7.2  Profinite groups 


profinite We assume knowledge of topology (continuous maps, compactness, separation axioms, 
connectedness, product topology, Tychonoff’s theorem). 


Definition 12.7.8: A profinite group is a inverse limit lim. , Gi Of finite discrete topo- 
logical groups Gi. 
Suppose that y; : G — G; are all surjective. The order #G of G is the formal product 


er vp(IGil) | 
Pp 


In other words it is the “least common multiple” of the |G]. 


We know that if we only consider the G; as groups, then by Theorem {12.7.5} the inverse 
limit can be described as the the set of tuples (g;)ie7 such that y5(g;) = g, for every transition 
map ~; : G; + Gj. But we need to show that the inverse limit is well-defined when the G; 
are topological groups. We give a topology on the inverse limit of groups, him, ‘ G;, so that it 
satisfies the UMP for the inverse limit of topological groups. (In the category of topological 
groups, homomorphisms must be continuous. ) 


Proposition 12.7.9: Give 
Ges lim G; 
groups 

the following topology: Equip each finite group G; with the discrete topology and J],<; G; 
with the product topology. Then G = him G; is the closed subspace of ]],<; Gi; of compat- 
ible sequences; give it the subspace topology. 

Then 

G= lim Gi. 
top. group 

Proof. We show that G satisfies the UMP. 

First, note that the maps y; : G — G; are continuous. Indeed for any open U; € Gi, 
letting 7; be the projection map [],<; Gi, 7; (Uj) is open. Hence y;'(U;) = 77 '(Ui) NG is 
open in G. 
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Now let H be a topological group with compatible maps 6; : H — G;. In order for [12.1] 
to commute, we must define 
p(h) = (Bi(h))i- 
This is a continuous map §; : H — J]; G; because it is the product of continuous maps; it is 
also a homomorphism. Its image is in G C J]; G; because the 6; are compatible. Since G is 
given the subspace topology, @ is continuous, as desired. 


The following characterizes the topology of profinite groups. 


Proposition 12.7.10: A topological group G is profinite iff it is compact, Hausdorff, and 
totally disconnected. 


Proof. First suppose G = lim. f G; is profinite. 


1. TI;ep Gi is compact by Tychonoff’s Theorem (an arbitrary product of compact spaces 
is compact) so the closed subspace G is compact. 


2. Given g = (g;) and h = (h;), suppose g; # h;. Partition G; into two sets A and B 
containing g; and h;, respectively. Then a; '(g;) and a;'(g;) are disjoint clopen (open 
and closed) sets containing g and h, respectively. This shows that G' is Hausdorff and 
totally disconnected. 


The converse is left as an exercise (we won’t need it). 
Profinite groups can be constructed from arbitrary abelian groups as follows. 
Definition 12.7.11: Let G be a group. Define the profinite completion of G to be 


G= G/N 


SIE 


N normal of finite index 


with the natural projection maps. 


Example 12.7.12: This agrees with our definition of Zin Example }12.7.7| In the profinite 
topology of Z, the subsets nZ form a neighborhood base of 0. 


8 Infinite Galois theory 


Let ./K be an infinite Galois extension. We equip G(Q/K) with a topology by interpreting 
it as a profinite group, as follows. 


Proposition 12.8.1: 
G(Q/K) = lim G(L/K), 


L/K finite 


where the limit is with respect to the quotient maps G(M/K) > G(L/K). 
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Proof. Identify the right side with compatible elements of [],/,% G(L/K) and send ao € 
G(Q/K) to (o|z)z. This is a bijection because any element of 2 is in a finite extension over 
K, so specifying a map 2 — 2 is the same as specifying a compatible sequence of maps 
L — L for every finite Galois extension. 


Now we give G(Q./K) the profinite topology. Equivalently, it is the topology such that a 
neighborhood base of 1 is 


G(S) = {¢ € G(Q/K):os=s foralls eS},  S finite. 


We next show surjectivity of the quotient map. 


Proposition 12.8.2: Every homomorphism o : L > ( extends to a homomorphism 2 > 2. 
If L/K is Galois, then the restriction map G(Q/K) — G(L/K) is surjective. 


Proof. Use Zorn’s lemma, as follows. Define a poset P whose elements are pairs (M, yy), 
where M is a field with L C M C Q and yy is a homomorphism M > Q. Introduce a 
partial ordering by saying 
(M, pm) S (N, pw) 
if MCN and yyl|lu = vm. If (Mi, yu,) is a chain (totally ordered subset), then it has a 
maximal element in P, namely, 
(Uae) 


where ¢ is defined as p(x) = y;(x) if  € M;. Thus by Zorn’s lemma P has a maximal 
element (M, ym). 
For any element a € Q, by finite Galois theory (ref) we can extend (M, pa) to (M(a), yaa): 
By maximiality of M, M = M(a),ie. M=Q. 
The second part follows directly. 


The following is the analogue of the fixed field theorem. Note that topology now plays a 
role. 


Theorem 12.8.3 (Fixed field theorem, infinite extensions): Suppose 2/K is Galois and 
G = G(Q/K). 

1. G(Q/L) is closed, and N¢°/) = L. 

2. For every subgroup H C G, G(Q/0#) = H. 


Proof. 1. The sets G(S) are open of finite index, hence closed. Hence G(Q./L) = Nainite scr GCS) 
is closed. 


For the second part, note that for every finite Galois extension M/L, we know 
OE OEN Me Bi 
Since this is true for every such W/L, and G(Q/L) - G(M/L) is surjective, the result 


follows. 
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2. It is clear that G(Q/Q") D H. By part 1, G(Q/Q*”) is closed, so it contains H. 


FINISH... 


Theorem 12.8.4 (Fundamental theorem of infinite Galois theory): There is a bijection 


between closed subgroups of G and intermediate fields L with K CL CQ. 


HH Ot 
G(Q/L) aL. 


We have the following. 


1. This map is inclusion-reversing. 


2. H is open if and only if [Q” : K] < oo. Then [G: H] = [07 : kK). 


3. cHo! corresponds to oM, so H is normal if and only if Q%/K is Galois. 


G(Q#/K) = G/H. 
Note given closed, open iff of finite index. 
Proofs! + Some more silly properties. 


Example 12.8.5: We have 


G(F,/F,) = lim G(Q/F,) = lim Z/nZ == Z. 


Example 12.8.6: Let 
Q(Coo) = QA{6n : 2 € NJ). 
Then ee 
G(Q(Coc)/Q) = lim G(Q(¢n)/Q) = lim(Z/nZ)* = Z* 


neEN 


(Note that, by the Kronecker-Weber Theorem |24]24.7.2} Q(¢.) = Q”».) 
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Chapter 13 


Arithmetic over Finite Fields 


arith-over-t I'his section is from my final paper in 18.784... need to integrate Our main goal in 
this chapter is to find a way to find the number of solutions for equations over finite fields. 
One problem we will look at in detail is, for a fixed 6, how many solutions are there to 


baylte ty! 


over a finite field? We encapsulate the number of representations as a sum of n dth powers 
in a sum of orthonormal functions on F, called the additive characters y. We consider the 
product 


w= ( Ew) = YS x(yft---+yf). (13.1) 


ye Yl grees Yn €Fg 


(The additive characters have the nice property that y(a + b) = x(a)x(0).) Note (13.1) is 
true for all characters. To extract out the coefficient of .(b), we multiply by x(b), average 
over all distinct characters x, and take advantage of orthonormality to get 


( | 
thesumf’gn(D) = ys | ( - Ww) xO) (13.2) 


qx 


In the next section we will give define and give properties of characters that help us esti- 


mate (13.2). 


1 Characters 


To evaluate it would be helpful if y(y7) = x(y)¢. However, this cannot hold as we 
defined x so that it would preserve additive structure, not multiplicative structure. Thus to 
evaluate we would like to rewrite it as a sum of functions w such that w(ab) = w(a)w(b), 
and such that the set of w are orthonormal. Thus we will need both the concepts of additive 
and multiplicative characters. We make this precise below. 
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Definition 13.1.1: charace Let G be an abelian group. A character of G is a homomorphism 
from G to C*. A character is trivial if it is identically 1. We denote the trivial character by 


Xo or Wo. 


Definition 13.1.2: Let R be a given finite ring. An additive character y : Rt 4 Cisa 
character y with R considered as an additive group. A multiplicative character w : RX > C 
is a character with R*, the units of R, considered as a multiplicative group. 


The two cases we will be working with are R = Z/NZ (Section (1-1), and R = F, 
(Section|1.2). We extend multiplicative characters ~ to R by defining v(x) = 0 for x € R\R*, 
except we follow the convention of setting W(0) = 1 when R = F,. Note that in any case 
the extended w still preserves multiplication. 

We proceed to give an explicit description of characters for abelian groups. First, recall 
the following theorem. 


Theorem 13.1.3 (Structure Theorem for Abelian Groups): Let G be a finite abelian group. 
Then there exist positive integers m1,...,™x so that 


G2=Z/mZ x --- x Z/m,Z. 


Theorem 13.1.4: «a The group G = Z/m,Z x --- x Z/m,Z has |G| characters and each 
is given by an element (r1,...,7%) € Z/miZ x --- x Z/my,Z: 


Xrpyeslhk (ni, aac a) _ Il e ™ 


Moreover the set of characters G form a multiplicative group isomorphic to G.! 


Proof. It is easy to check that y = X7,...,.r, 18 a homomorphism. Let e; be the element 
in G with 1 in the jth coordinate and 0’s elsewhere. Since x(e;)”’ = 1, we must have 
Qrir. 


: 


x(e;) =e ™ for some r;. Each element of G can be expressed as a combination of the e,, 
so this shows all characters are in the above form. 
This shows that (r1,...,7%) > Xry,....r, 18 surjective and hence an isomorphism. 


Corollary 13.1.5: sumcnar Every finite abelian group G has |G| characters. 


Theorem 13.1.6 (Orthogonality relations): om Let G be a finite abelian group and y;, 
1<j <n be all characters of G. Then 


( . 
: 1 — 7 U.- 7#R 
1. (Row orthogonality) (xxi) = eq Drala) = 44 42 
gEG ’ = 


'This is a noncanonical isomorphism. 
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( 
~ 0 h 
2. (Column orthogonality) S> x;(g)x;(h) = 4” g# 
j=l |G, g= h 


Proof. Write Gas Z/m,Z x --- x Z/m,Z. Let (r1,...,7%) and (s1,...,5,%) be in G. Then 


2ni(r;—s5)P5 


k 
rice Vas) = S- Il e mg yayy (13.3) 


(P15--PRIEG I=L 


| k—-1 2mi(rj—sj)py \ Me-l | 
- > (i ae) Se ePaire— sn )Pi | diene (13.4) 


(P1,-+sPr—-1)€G j=l pr=0 


If (Fij2s2s hh) = (Sige Sy) then evaluates to |G]. Otherwise, we may assume without 
loss of generality that r;, 4 s,; then the inner sum in evaluates to 0 by writing it as a 
geometric series. 

The proof for column orthogonality is similar. 


The most useful case of row orthogonality is when we set yz = Xo: 


Corollary 13.1.7: sumo If x is a character of G and x # xo then 


x) =o. 


geEG 


Having established the basic properties of characters of abelian groups, we now turn to 
the specific cases Z/NZ and F,. 


1.1 Dirichlet characters 


For our applications, it is helpful to think of consider characters on Z/NZ as functions on 
Z. From Theorem }13.1.4} the additive characters are simply given by 


2Qriag 


Xa(g) =e % 
Next we consider multiplicative characters. 


Definition 13.1.8: A Dirichlet character of level N is a function y : Z — C that induces 
a group homomorphism 
x : (Z/NZ)* > C, 


and such that x(n) = 0 for any n sharing a common factor with N. In other words, it 
induces a multiplicative character Z/NZ > C. 

We say x is principal if y(n) = 1 for all (Z/NZ)*, and primitive if y does not induce 
a group homomorphism (Z/MZ)* — C for any M < N. 

We say x is even or odd if x(—1) = 1 or y(—1) = —1, respectively; we say y is real 
when im(x) C R and say it is nonreal otherwise. 
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Any character can be written uniquely as a product of a primitive character 1 of level 
M | N and the principal character of level NV: 


X = X1X0- 


1.2 Characters on finite fields 


field-char We give the additive and multiplicative characters on F, explicitly. We know that 
is cyclic; let € be a generator. 


Theorem 13.1.9 (Multiplicative characters of Fj): muc The multiplicative characters of F, 
are given by 


Qrijn 


ue) =e 
for0<j<q-1. 


Proof. By identifying € € FX with 1 € Z/(q — 1)Z, this follows directly from Theorem|13.1.4 


Describing the additive characters takes slightly more creativity, since it is inconvenient 
to decompose F* into cyclic groups. 


Theorem 13.1.10 (Additive characters of F,): Suppose g = p’ with p prime. The additive 
characters of F, are given by 


addVa(g) = ep (a9) (13.5) 


for a € F, where? 
r-1 


tr(g)=gt+g?t+---+9° 


Proof. The automorphisms of F, fixing F,, are generated by the Frobenius automorphism o 
sending g to g?. Since tr(g) is fixed under this operation, it must be in the ground field F,. 
This makes well-defined since only the value of tr(ag) modulo p matters in (13.5). Th 
fact that y, is a homomorphism comes directly from the fact that o is a homomorphism. 
Since v1 (a9) = Xa(9), if Xa = Xv then yi(ag) = vi(bg) and yi((a — b)g) = 0. However, 
x1 is not trivial (identically equal to 1) since there are at most p’~' values of g such that 
g+::-+g? | =0. Thus a= b. This shows all characters in our list are distinct. Since we 
have found |G] characters we have found all of them. 


Remark 13.1.11: In general, a n-dimensional complex representation of a group G is a 
homomorphism p from G into GL,,(C), and the character x of a representation is defined 
by x(g) = tr(p(g)). This coincides with Definition [13.1.1] for abelian G, if we just consider 
1-dimensional representations, since p is multiplication by a constant and y is just that 
constant. 


For the general definition of trace see Definition |14}14.2.1 
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The general case of Corollary {13.1.5]is replaced by the following: every finite group has a 
number of irreducible characters equal to the number of conjugacy classes. The orthogonality 
relations hold when we consider just irreducible characters, and with |G| replaced by the size 
of the centralizer of g in the equation for column orthogonality. 


2 Gauss Sums 


gauss-sums To relate additive characters to multiplicative characters, we need to evaluate sums 
in the form 


euG(P,x) = D> v(y)x(y). (13.6) 


yER* 


where w is a multiplicative character and y is an additive character. 
Suppose we wanted to write an additive character on F, in terms of multiplicative char- 


acters. By row orthogonality, a ae w(y)w(g) equals 1 if y = g and is 0 otherwise. This 
allows us to introduce multiplicative characters as follows: for y € F*, 
1 ae 
x(y) = — DY x9) YS ova) 
gq — 1 x a= 
1 = 
=—z 2 VY) dL Y@)x(9) 
wer* geFg 
1 = 
es, GW, x)v(y). (13.7) 
very 


The Gauss sums are the coefficients of the expansion of y in terms of multiplicative charac- 
ters. The next theorem tells us how to calculate Gauss sums. 


Theorem 13.2.1: egau Let Wo and yo denote the trivial multiplicative and additive characters 
on F,, respectively. Then for multiplicative and additive characters ~ and x on F,, we have 


(qg—1, ~ = Y0,X = Xo 
G(w,x) — 1 =, w= 0,X # Xo 
Ww wv F W0,X = Xo 


and 


IG, x) = Va, YA V0, xX F Xo- 


If w is a nontrivial multiplicative character and y is a primitive additive character on 
Z/NZ, then 
IG, x) = VN. 
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Proof. The first case is trivial. For the second case, 


Givo,x) = dS xy) = (= | —~l=-1 


ycky yeFg 


by Corollary |13.1.7| The third case directly from Corollary |13.1.7) with w. 


Now we consider the case case when w is nontrivial, and either y # xo (in the case 
R=F,) or x is primitive (in the case R = Z/NZ), respectively. We have 


IE, x)? > W(g1)v(92)x(91)x(92) 


g1,g2ER* 


= SY (gi 'g92)x(g2 - 91) 


g1,g92ER* 


= >> dX WA)x(g(h - 1)) setting h = g, ‘go 


hEeR* gic R* 


= Du wir) | VE xah=1)\= xv) 
| 


he R* qmeER yER\R* 


1) ( YS x(a(h - »)] by Corollary [13.1.7] with 


heR* gicR 


Now we note the following: when h = 1 all terms in the inner sum are 1, so it equals q or 
N, respectively. When h ¥ 1, consider two cases. 


1. R=F;,: As g; ranges over F,, g:(h — 1) ranges over F. 


2. R=Z/NZ: As g, ranges over Z/NZ, gi(h — 1) ranges over a subgroup H C Z/NZ, 


hitting each element ral times. Since y is primitive, y|y is nontrivial. 


In either case, Corollary |13.1.7| gives the inner sum to be 0. Hence |G(w, y)|? evaluates to 
V(1)¢q =¢ or W(1)N = N, respectively. 


We will need the following fact later on. 


Proposition 13.2.2: gaussprop Let R = Fy or Z/NZ. For a € R* and be R, 


G(, Xa) = V(a)G(Y, x0). 
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Proof. Using the fact that y.(g) = x1(cg), 


d= v(y)Xa0(y) 


yERX 
d= v(y)xo(ay) 


yeER* 


= S- w(aty)xo(y) replacing y > a~!y 
yER* 


=~(a)* YS v(y)xo(y) 


yER* 


= W(a)G(v, xo) 


Gy, Xab) 


3 Enumerating Solutions 


We return to our original problem. Rather than just work with sums of dth powers, we work 
with diagonal equations 
diag"! +---+ any" = (13.8) 


where a; € F¥ and d; € N. First, note that because of the following lemma, we can restrict 


to case where d;|q — 1. 


Lemma 13.3.1: sa The multisets {y“|y € F,} and {y&47-)|y € F,} are equal. 


Proof. Let € be a generator for F*, and write d = k gcd(d,q — 1) where gced(k,q — 1) = 1. 


q? , 
Then removing the one occurrence of 0 in the two sets, we get {€7|0 < 7 < q—1} and 


fesecd(4a-l)|Q < j < qg—1}. The lemma follows from the fact that as multisets, 


{jd (mod q—1)|0<j <q—1}=({jgced(d,qg—1) (mod g—1)|(0<j<q-1}. 


Indeed, each multiple of gcd(d, q — 1) appears Sateen times on both sides. 


As (13.8) always has the trivial solution when b = 0, we just need to estimate the number 
of solutions to (13.8) when b 4 0. 


Theorem 13.3.2: [?, 6.37]maintum Fix b # 0,d;{q¢ — 1 and let N be the number of solutions 
to (13.8) when b ¥ 0 is fixed. Then 


n—1 


IN — "| < [i -1)--- (de -I) — (1-972) M(dh,..., dala 


where M(dj,...,d,) is the number of n-tuples in the set 


$2 | Gives sdn) € 2" S eS dh 1 and yee. 


i=1 “% 
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Note that we would expect N to be close to q”~', because there are g” possible choices 
for (y1,---,Yn) and q possible values for their sum. 


Proof. We use the idea mentioned in the introduction. We have 


1 1 
1 ; » x(ary + +++ + any’ )X(b) = ; S- x(aryt?) «++ x(any@”)x(0) 


Yiy--Yn€Fq, XEFE Yl ye Yn€Fq, XCF 


since by row orthogonality the inner sum is 1 if ayy@ +-- -+anye” = b and 0 otherwise. Note 
that yo contributes gq” to the sum. Taking it out and factoring the remaining terms gives 


mN = q™? + S- [xe oT p> vou) (13.9) 


xEF tT .xAXo 


We write the sums of additive characters as sums of multiplicative characters using the 
following lemma. 


Lemma 13.3.3: ay» Let y be a nontrivial additive character and \ a multiplicative character 
of order d dividing g — 1. Then 


S~ x(ay") = EX a))G(A’, x). 
ycFq j=l 


Proof. Note that exists since the group of multiplicative characters is isomorphic to Z/(q— 

1)Z by oy ae ee Suppose x = x-. We write x as a sum of eevee characters 
using (13.7), get the Gauss sum to be independent of a by using Proposition [13.2.2| 2| and take 
out ct aa et as we were hoping to do: 


Y= x(ay) = YS xaely 


ye yeFq 


te 2 S” G(X, Xac)) y") 


werx ycky 


eest= = Y FET, xe) T vw)" (13.10) 


yeF 
wer; . 


et = 1+ S°X(a)PG(d’, x) (13.11) 


n= 3 (a)G(A!, x) (13.12) 
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Note (13.11) follows since by Corollary /13.1.7) Dycrx w(y)? = 0 unless 7% is the trivial 
character, which is true iff ~ is a power of A. In that case, the inner sum in (13.10) is q—1. 


In (13.12) we used G(wo, x) = —1 (Theorem 13.2.1). 
2rit 
Using Lemma|13.3.3]and letting A; be the multiplicative character with \,;(€") =e % we 


rewrite (13.9) as 


a 1 n d-1 
N=-qt== YO [x() TP Aj(aj)*G OF, x) 
Ge j=l k=l 
xEFg sxX#XO 
1 1 n 1 
= X(b)Ar" (a1) +++ Xn” (dnG Ar, x) +++ Gn", x) 
q HEE apengg (Rirokn SRSA 1 
1 1 he 
= = > Xe(b)A1 (a1) = Ag (an)G(Ar™, xe) a GO Xe) 
@ CeRX (k1,.4kn),1<ki<dj—1 
1 ZV FR 1 y hn 
lotsofgauss —= — Gir™, Vas) ne Gin, Nas) S- ya(c)ar (c) oS Wig (c) 
F (ky j.uskn) Shi Sdj—-1 ceF® 
(13.13) 
1 1 re 
ee CO Ce CO eae) 2 


where in (13.13) we used Proposition |13.2.2| twice, to get 
—k <—k 


j yi j yi 
Xj? (aj) Oz, Xe) = Ay" (CA? (0) GAG x1) = Ay" ()GOY™, Xa,)- 


Now we apply Theorem |13.2.1|to get that |G(A*", va,) 


= ,/q. Note 
— a (2ni) (SL +.+4n)t 
OG Teh y(t = c(t 


is the trivial character iff (ki,...,kn) € S. Hence ico. . Neo) | = 1 if (hi,. 225 ha) ES 
and ,/q otherwise. Using this and the triangle inequality, (13.14) becomes 


1 n n+l 
IN-g" |< “ls +q°? (dy —1)+++(dn — 1) —[SI)], 


proving the theorem. 


4 Applications to Waring’s Problem 
Now we derive Small’s bound for Waring’s constant g(d,q), the minimum n such that (13.8) 
has a solution with d; = --- =d, = d for all b. By Lemma|13.3.1| g(d, q) = g(ged(d, q—1), ¢), 


so it suffices to consider the case d|q — 1. 
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First, note that sufficient condition for Waring’s constant to exist is that the set {y“|y € 


al 
by €%, and any subfield is multiplicatively generated by € >=. for some 
with p prime we need 
p-l 
p= 


Apply Theorem |13.3.2) (dropping the term with M(d,...,d,)) to get 


n-1 


aboutsamelV = oo = (d = 1)"q 2 


badcond {+d for every proper divisor k of r. 


This is positive when 


Ing 


yyig 7 > (d—1)” — —(Ing—2In(d—1)) > ms 


im 
2 
Thus we obtain the following bound for g(d, q): 


Theorem 13.4.1: msi Suppose dig —1 and q > (d—1)?. Then 


Ing 
= Pls 
g(d,q) $ Ing — 2In(d — 1) | 


F,} is not contained in a proper subfield of F,. Since this set is generated multiplicatively 


k|d, writing gq = p” 


(13.15) 


(13.16) 


(13.17) 


Note that in particular, (13.17) for n = 2 allows us to make the “inverse” statement that 


if ¢g > (d—1)*, then the equation y? + y? = b has a solution for any b € 


F,. That is, for any 


d, in any sufficiently large finite field every element can be written as a sum of 2 dth powers. 
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Rings of integers 


ring-ot-integers When we have a field extension L of Q, we would like to define a ring of integers 
for L, with properties similar to the ring Z C Q. We will define this ring of integers in a 
slightly more general context. 


1 Integrality 


Definition 14.1.1: Let A be an integral domain and L a field containing A. An element of 
x € L is integral over A if it is the zero of a monic polynomial with coefficients in A: 


to” + Oye tees tae tap =0,n > 1, ao,...,dn-1 € A. 
The integral closure of A in L is the set of elements of L integral over A. 


Example 14.1.2: The integral closure of Z in Q is simply Z itself (we see this more generally 
in Proposition [14-1.8). Thus, integral closure generalizes the notion of what it means to be 
an “integer” in other number fields. As we will see in Example |14.4.7| for d squarefree, the 
integral closure of Q(Vd) is Z[Vd] when d = 3 (mod 4) and Z [=e when d= 1 (mod 4). 
2 (eye is much nicer in integral extensions—which is why, for instance, we would study 


Z | rather than just Z[/—3]. 


Theorem 14.1.3: Let L be a field containing the ring A. Then the elements of L integral 
over A form a ring. 


Proof. We give two proofs. We need to show that if a,b are algebraic over A then so are 
a+ 6 and ab. 

Proof 1: Let p,q be the minimal polynomials of a, b, let a1,...,a, be the conjugates of a and 
by,..., 6; be the conjugates of b. The coefficients of 


I] (@-(a@+4,)), II («- (ab;)) 


l<i<m 1<i<m 
l<j<cn l<j<cn 
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are symmetric in the a; and symmetric in the b; so by the Fundamental Theorem of Sym- 
metric Polynomials can be written in terms of the elementary symmetric polynomials in the 
a; and in the b;, with coefficients in A. By Vieta’s Theorem these are expressible in terms 
of the coefficients of p,q, which are in A. Hence these polynomials have coefficients in A. 
They have a+ b, ab as roots, as desired. 


Proof 2: We use the following lemma. 


Lemma 14.1.4 (Criterion for integrality): criterion-tor-integratity An element a € L is integral 
over A if and only if there exists a nonzero finitely generated A-submodule of L such that 
aM C M. If so, then we can take M = Ala]. 


Example 14.1.5: For example, a fails this criterion over Z—multiplying by it has the 


effect of making M “finer.” \/2, however, is integral. 

In the case A = Z and B = Q, a € Q is integral over Z iff a € Z. Indeed, a € Z satisfies 
x —a, and if a ¢ Z, then powers of a contain arbitrarily large denominators so Zia] is not 
finitely generated. 
Proof. =>: If w satisfies a monic polynomial of degree n, then A[a] is generated by l,a,...,a"~". 

<: Suppose M is generated by v1,...,Un. Then we can find a matrix T' with coefficients 
in A such that 

M1] | 
a;i;=T]:]. 


Lon} Lon 


Since v1,...,Un #0, al —T is singular, and det(al —T) = 0. This gives a monic polynomial 
equation satisfied by a. 


Now for a, 8 € L and let M = Ala] and N = A[]. Note 


1. if M,N are finitely generated by {a;} and {8;}, then MN is finitely generated by 
{aghz}: 


2. aaMN C MN and (a+ 8)MN C MN. 


Hence af and a + £ are integral over A by Lemma|14.1.4]as needed. 


For the rest of this chapter, A is an integral domain, K is its fraction field, LZ is an 
extension of kK, and B is the integral closure of A in L. 


ic 2B (14.1) 


k ——A 
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Definition 14.1.6: A is integrally closed or normal if its integral closure in K = Frac(A) 
is itself. 


Proposition 14.1.7: If L is algebraic over K then every element of L can be written as 
® where b € Banda € A. Thus L = Frac(B). In particular, for any extension L/Q, 
Frac(@,) = L. 


Proof. Given a € L, suppose that it satisfies the equation 
P(x) := Gy” + Gn_12™ + +---+ ag =0 


with do,...,@n € K and a, 4 0. Since Frac(A) = K, by multiplying by an element of A as 
necessary we may assume do,...,@, € A. Then 


= x = - os 
a P i = ob dye + GnGyot”™ ? ++ +0? ‘ap. 


Hence a,,q@ is integral over A, i.e. a,a € B. This shows a is in the desired form. 
For the last part, take K = Q and A = Z. 


For short we call the “AKLB” setup if we further assume A is integrally closed in 
K. In the usual case, A is the integral closure of Z in K. in this case, we write A = @x.! 

When F = Q, the algebraic closure of Q, a € Q is called an algebraic number and a € Gg 
is an algebraic integer. 


Theorem 14.1.8 (Rational Roots Theorem): rationat-rootstnm A UFD is integrally closed. 


Proof. Suppose R is a UFD with field of fractions K. Let « € K be integral over R; suppose 
x satisfies 
+ Gye" |) +++++a9=0 


where dg,...,@n,_1 € R. Write x = : where p,q € F are relatively prime. Then multiplying 
the above by q” gives 
p” + Qn—1p" ‘gq ++++ + aipq"| + aoq” =0 
q(dn—1p"* + +++ + agg") = —p 


n 


Thus q | p, possible only if gq = 1. This shows z € R. 


Note that in the definition of integrality, an element is integral if it is the zero of any 
monic polynomial in A[z]. However, it suffices to check that its minimal polynomial is in 
Alq]. 


Proposition 14.1.9: integral-min-poly Let L be an algebraic extension of K and A be integrally 
closed. Then a € L is integral over A iff its minimal polynomial f over K has coefficients in 
A. 


‘Later on, when we take K to be an extension of the p-adic field Q,, we will use Ox to denote the integral 
closure of Z, in K. 
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Proof. The reverse direction is clear. For the forward direction, note all zeros of f are integral 
over K since they satisfy the same polynomial equation that a satisfies. The coefficients of 
f are polynomial expressions in the roots so are integral over A, and hence in A (since they 
are already in Kk). 


Proposition 14.1.10 (Finite generation): fnite-generation 


1. Let AC B CC be rings. If B is finitely generated as an A-module and C is finitely 
generated as a B-module, then C is finitely generated as an A-module. 


2. If Bis integral over A and finitely generated as an A-algebra, then it is finitely generated 
as an A-module. 


Proof. 
1. Take products of generators. 
2. Let algebra generators be (),..., 8m. Then 


AC A[Bi] C++ C Alfa, .--, Bm] 


is a chain of integral extensions, so item 2 follows from 1. 


Combining this proposition with Lemma|14.1.4] we get the following: 


Proposition 14.1.11 (Transitivity of integrality): integraity Let A C B C C be integral 
domains and Kk, L, M be their fraction fields. 


1. If B is integral over A and C is integral over B, then C is integral over A. 


2. Let A’ is the integral closure of A over B and A” be the integral closure of A’ over C. 
Let A” be the integral closure of A in C. 


3. The integral closure of A is integrally closed. 
Proof. 


1. For 7 € C, let b; be the coefficients of the minimal polynomial of C' over B. Then y¥ is 
integral over A[bo,..., bm], so by Proposition {14.1.10] item 2, A[bo,..., bm, Y] is finitely 
generated over A. Since yA[bo,...,5m,y] CG Albi,..., 6m, 7], by Lemma 7 is 
integral over A. 


2. By item 1 applied to A C A’ C A”, A” is integral over A so A” C A”. Conversely, any 
element a € A” is integral over A so a fortiori integral over A”; thus A” C A”. 


3. Follows from item 2 applied to A= B=C. 
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2 Norms and Traces 


Let B be a free A-module of rank n. Then any element ( € B defines an A-linear map mg, 
(or [8]), multiplication by 6. It is helpful to think of ( as a linear map because then we can 
apply results from linear algebra. 


Definition 14.2.1: trace-aet-char The trace, determinant, and characteristic polynomial of mz 
are called the trace, norm, and characteristic polynomial of (. 


These are computed by choosing any basis of €1,...,@, for B over A, and then computing 
the action of 6 on this basis. 


Proposition 14.2.2 (Elementary properties): mm-ciempr The following hold (a € A; 8,5’ € 
B): 


1. tr(8 + 6’) = tr(8) + tr(f’) 
2. tr(a8) = atr(Z) 

3. tr(a) = na 

4. Nm(@6’) = Nm() - Nm(@’) 
5. Nm(a) = a” 


Proposition 14.2.3 (Behavior with respect to field extensions): ni:-ix: Suppose L/K is a 
degree n field extension, MW is a finite extension of L, and @ € L. 


1. (Relationship with roots of minimal polynomial) If f(X) is the minimal polynomial 
of 8 over K and (1,..., 8m are the roots of f(X) = 0 in a Galois closure of K, then 
letting r = [L: K(8)] = 2, 

(a) chary/«(8) = f(X)" 
(b) trzyx(B) =r(Bi +--+: + Bm) 
(c) Nmz/x(@) = (A1--- Bm)” 


2. (Relationship with embeddings) Suppose L is separable over kK’, M is a Galois extension 
of K, and o1,...,@ are the set of distinct embeddings L + M fixing kK. Then 


(a) trejx(8) = 01(8) +--+ + on(8) 
(b) Nmyzyic(8) = 01(8) --- on(8) 


In particular, this is true when L = M is a Galois extension of K, and we can think 
of the o; as simply the elements of G(L/K). 
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3. (Transitivity of trace and norm) Suppose 8 € M and M/K is separable.” Then 


(a) trax (8) = try (ttmy1(8)) 
(b) Nmyyx(8) = Nmp/x(Nmwjz(P)) 
4. (Integrality) Assume AKLB. If 8 € B, then the coefficients of charz/«(@), and hence 


trp/x(G8) and Nmz/x (3), are integral over A. In particular, if A is integrally closed in 
L then they are in A. 


Proof. 


1. Ifr =1,ie. A[G] = L, then by the Cayley-Hamilton Theorem, f(mg) = 0. Since 
f(X) is irreducible, f(X) | chary/«(@). However, these are monic polynomials of the 
same degree so they are equal. 


In the general case, take a basis x; of K[G] over K and a basis y; of L over K[G]. Then 
x;y; form a basis of L over K, and the matrix of mg with respect to this basis is n 
copies of A. This proves (a), which implies the rest of the statements. 


2. Let 61,..., 68m be the conjugates of 3. There are m distinct imbeddings k(3) > M; 
they each take 6 to a different 6,. Each of these imbeddings extend to r := [L : 
Kk (8)] = 2 imbeddings L > M. Now use item 1. 


3. Note that for any finite extensions K C LC N with N Galois, an imbedding L > N 
fixing kK can be extended to a K-automorphism on N, and so be considered an element 
of the set G(N/K)/G(N/L).2 


Let N be a Galois extension containing M. By item 2, 


tru/K(8) = » o() 


o€G(N/K)/G(N/M) 


trp (trmyt(C)) = tresK ( S- “(i 
o€G(N/L)/G(N/M) 


= » = r(a(8)) 


TEG(N/K)/G(N/L) c€G(N/L)/G(N/M) 


where in the second sum we take arbitrary representatives T € G(N/K) and o € 
G(N/L). These are equal because for any choice of these representatives, 


{a € G(N/K)/G(N/M)} = {ra | 7 € G(N/K)/G(N/L), 0 € G(N/L)/G(N/M)} 
when considered in G(N/K)/G(N/M) (i.e. as imbeddings M — N fixing Kk). The 


same is true of the norm. 


?The last condition is not necessary. TODO: Find a proof of the general case. 

3Using the primitive element theorem, write L = K(3). The imbeddings L > N are those taking 6 to a 
conjugate; there are [L : K] imbeddings. But we know G(N/K)/G(N/L) = [L : K], so all of the imbeddings 
must be extendable. We also use this fact (in addition to a counting argument) in the proof of 2. 
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4. The minimal polynomial of a has coefficients in A, by Proposition |14.1.9| Hence the 
result follows from item 1. 


3. Discriminant 


Definition 14.3.1: ais-a: If B is a ring and free A-module of rank m, and 61,..., Gm € B, 
then their discriminant is 


D(Gi,.--; Am) = det[tra;a(GiB;)hi<ij<m: 
Proposition 14.3.2: aisc-basis If the change of basis matrix from 7; to 6; is T, then 
Disa) = Ge Pe Digest: 
Proof. Let M, and My, be the matrices of the bilinear form 
(a, a") = trg/s(aa’) 


with respect to the bases (3),..., 8m) and (71,---,%m), respectively. Then, using the change 
of basis formula for bilinear forms, 


D(61,.--; Bm) = det(M1) 
D(N1,-++,%m) = det(Mp) 
M, = T'M,T 

det (M2) = det(T)? « det(M;) 


from which the result follows. 


Consider the discriminant of an arbitrary basis of B over A. By the above fact, this is 
well-defined up to multiplication by the square of a unit. The residue in A/(A*)? is called 
the discriminant disc(B/A). The discriminant also refers to the ideal of A this element 
generates. 

Note disc(B/A) can be thought of as the determinant of the matrix of the bilinear form 


(8, 8") = traja(BB’). 


Proposition 14.3.3 (Criterion for integral basis): crit-int-basis Let A C B be integral domains 
and B be a free A-module of rank m with disc(B/A) 4 0. Then y1,...,%m € B form a basis 
for B as an A-module iff 


(D(M1,...5%m)) = (disc(B/A)) 


as ideals. 
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Proof. Let 8; be a basis. If the change of basis matrix from y; to 6; is T, then by Proposi- 


tion [14.3.2] 
D(M1,..+;%m) = det(T)? - D(B1,..., 8m) = det(T)* disc(B/A) 


Now 4; is basis iff T is invertible, iff det(T) is a unit, iff (D(11,..-,%m)) = (disc(B/A)). 


Proposition 14.3.4 (Discriminants and Field Extensions): aisc-ana-te 


1. (Relationship with embeddings) Let L be separable finite over K of degree m, and 
01,...,%m be the embeddings of L into a Galois extension M fixing K. Then for any 
basis 3,,..., 8m of L over K, 


D(G4, sss sin) = det(o;8;)? a 0. 


2. (Nondegeneracy of trace pairing) If B is free of rank m over A (with fraction fields 
K, L as above), then the pairing 


(8, 8") + tr(BB") 
is a perfect K-bilinear pairing, and disc(B/A) = disc( K/L) # 0. 


Here perfect means that the map a++ (b+ (a,b)) is an isomorphism L — L*, and similarly 
for bh (a (a,6)). This is equivalent to saying that the bilinear form is nondegenerate. 


Proof. Use Proposition |14.2.3(1b), and that o,,det are both multiplicative. Inequality fol- 
lows from independence of characters: 
Let G be a group, Fa field. Then the homomorphisms G — F™ are linearly independent. 


Thus for K of degree m over Q, we can talk of disc(@x«/Z). 
A closely related quantity to the discriminant is the different. 


Definition 14.3.5: Assume AKLB, and suppose L/K is a finite separable extension. The 
codifferent of B with respect to A is 


B* = {y € L| tr(vy) € A for all x € B}. 
The different of B with respect to A is 
Dax =(B") 
In other words, it is the largest B-submodule satisfying tr(E£) C A. 
Note that Dpja = (B*)t. 
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Remark 14.3.6: two-deraise We will define the discriminant in general, when B is not neces- 
sarily a free A-module, in Chapter The relationship between the two definitions is the 
following: Let p be an ideal in A. Then Ay is a principal ideal domain (in fact, a DVR). Let 
S = A—p; then S~'B is free over S~'A by the structure theorem for modules. We have 
(disc(S-!B/S-1A)) = (pAy)™®) for some m(p). Then 


disc(B/A) =] p™. 
p 


4 Integral bases 


Proposition 14.4.1 (Finite generation of integral extensions): teoic Let A be integrally closed 
and L separable of degree m over K. There are free finite A-submodules M and M’ of L 
such that MC BC M’. Bisa finitely generated A-module if A is Noetherian, and free of 
rank m if A is a PID.* 


Proof. Let {(1,..-; Om} C B be a basis for L over K. Take a basis 6; so that tr(9;8;) = 445. 
Then 
AB, +++: + ABm CBC ASL +--+ + Afr. 


The second inclusion follows because if 6 € B, then writing 6 = 7; b;B., we have that 
b; = tr(38;) € A. (In other words, the 6! form a basis for the codifferent B*, which contains 
B.)Move some of the stuff on codiff? 

If A is Noetherian, then M’ is finitely generated, so its submodule B is finitely generated 
over A. If Aisa PID, then by the Structure Theorem for Modules (over PIDs), M is a direct 
sum of cyclic modules and a free module. Since it is contained in a free module of rank m 
and contains a free module of rank m, it must be free of rank m. 


The following is immediate: 


Theorem 14.4.2: If K is finite over Q (i.e. a number field), then @x is a finitely generated 
Z-module. It is the largest subring that is finitely generated over Z. 


Definition 14.4.3: A basis for @x as a Z-module is called an integral basis. 


Proposition 14.4.4: aisccaic Suppose AK has characteristic 0 (so L separable over Kk), L = 
K[6], and f is the minimal polynomial of 6 over K. Let f(X) = JI(X — 6;) in the Galois 
closure of L. Then 


D(1,B,--,B™ )= TE (6-6)? =(-1)r-Y? - Nmzx(f'(8)). 


1<i<j<m 


This is called the discriminant of f.° 


4Alternative proof: proceed as in 5.8. 
>This gives an alternative proof of the perfect pairing. 
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Proof. Note the £; are conjugates of 3; assume 6 = 6}. 


By Proposition |14.3.4| we have 


1. 2 23 a B 
me a ale ; 
DAB BP Y=). 2. .  . | = TT G-8), 
: Fe . 7 gm=t 1<i<g<m 


where the last statement follows by evaluating the Vandermonde determinant. 
For the second equality, note by Proposition |14.2.3{1c) that 


Nmz/x(f'(8)) = Nmz/K((81 — Bo) ee (Bi = Bn)) = Il Il (2: = B;) 


1<i<m 1<j<m, j#i 


=(-)" = TT (8). 


1<i<j<m 


Proposition 14.4.5: If K = Qla], a € @x, and D(l,a,...,a~') = disc(@/Z) then 
{1,a,...,a’~"} is an integral basis. 


Proof. Using change-of-basis and the correspondence between index and determinant, 
D(1,a,...,a@") = dise(@x/Z) - [Ox : Zlal]?. 
Now disc(@x/Z) € Z so [Ox : Zla]] = 1. 


Theorem 14.4.6 (Stickelberger’s Theorem):  stickeiberger 


1. Let s is the number of complex (nonreal) embeddings K — C. Then 
sign|disc(K/Q)] = (—1)*”. 
2. disc(@x/Z) = 0 or 1 (mod 4). 
Proof. 1. Write kK = Qla] by the Primitive Element Theorem and a;,...,a, be the real 
conjugates and 3), 3;,..., 65,8; be the complex conjugates. By Proposition |14.4.4} 


sign ( D(1/,..13.,0°-")) = sien ( II (4 - Bi | = [ve (ar, 


I<jss 1<jss 


2. Let a1,...,Q@m be an integral basis. Let P and —N be the sum of the terms in the 
expansion of det(o;a;) corresponding to even and odd permutations, respectively: 


P= VY [I cians 


even TES m t=1 


N= YY een: 


odd 7ESm i=1 
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Then 


disc(@%/Z) = det(aia;)? 
=(P—wNy 
=(P+N)*?—4PN. 


Take o € G(K®*"/Q). Note composition by o permutes the o;, say by v. Then 


P= YF [Lf oie-racy 


even TES m t=1 


N= QP [Leen 


odd 7ESp, 1=1 


and hence a permutes {P, N}. Hence o fixes P+ N, PN and they are rational. Since 
they are integral over Z they are integers. Thus the above is congruent to 0 or 1 modulo 
A, 


Example 14.4.7 (Quadratic extensions): quadratic-extensions Any quadratic extension of Q is in 
the form Q(./m) for some squarefree integer m. We find the ring of integers of Q(./m). 
Consider two cases. 


1. m = 2,3 (mod 4): The minimal polynomial of \/m is X? — m, so 
dise(1, /m) = (./m — (—/m))? = 4m. 


Note Taal must be a square by Proposition }14.3.2|so disc(Q(,/m)/Q) equals m 
or 4m. However, by Stickelberger’s Theorem, disc(Q(,/m)/Q) = 0,1 (mod 4). Hence 
disc(Q(./m)/Q) 4 m and disc(Q(./m)/Q) = 4m. By Proposition |14.3.3} 1, /m is an 


integral basis. 


2. m=1 (mod 4): Note 1+Vm is integral with minimal polynomial X? — X — co 


5 , SO 


ro (1 1 tn) _ (3 se _i =) — 


Since m is squarefree, disc(Q(,/m)/Q) = m and Proposition |14.3.3] says 1, t/a is an 


integral basis. 


The following tells us about integral bases for products of fields. Can we generalize from 
Q to extensions of Q? 
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Proposition 14.4.8: aisccompositum Suppose that K, L are field extensions of Q such that 
[KL : Q) =[K : Q\L: Q]. 
Let d = gcd(disc(K/Q), disc(L/Q)). Then 
1. Ox Cd "6K 6,,. 
2. If Gx, = OxG@,, then disc(K L/Q) = disc(K/Q)!" disc(L/Q)*'@. 
In particular, COROLLARY. 


Proof. Let {a1,...,@%m} be an integral basis for K and {(),...,8,} be an integral basis for 
L. By the degree assumption, we know that {a;3;} is a basis for KL over Q. Any element 
of KL integral over Q can be written as 


ay; 
ga-integral’Y = S- 048 (14.2) 


where gcd(r, gcd(a;;)) = 1. 

We need to show that r | d. Let x; = S07, “i 6;. We will turn into a system of 
equations by considering all embeddings K <> C, solve for the x; using Cramer’s rule, and in 
this way show that each x; is an algebraic integer in L divided by a bounded denominator. 

Note given embeddings ox : K ~ C and o, : L © C, there is exactly one embedding 
ox, : KL ~~ C such that restricts to ox and oy. It is clearly unique if it exists. To 
show existence, write K = Q(a) > Q(x) /(f(x)) by PET, and note that the characteristic 
polynomial of f does not change upon passing to L because of the degree assumption. Hence 
KL = L(a) = L(x)/(f(x)), and in extending oy, to oxz, we are allowed to send a = z to 
o1(a). 

Fix an embedding 0 : L <> C, and let oj,...,¢m be all embeddings K ~~ C. Then 
applying o, to [14.2] we obtain the system of equations 


> ox(04) ei = on(y), 1<k<m. 


By Cramer’s rule, letting D = det|(o%(a;)),;] we get Dx; = D; where D; has the ith column 
of D replaced by (o%(a;))%,. Note that D and D; are both algebraic integers. Using 


disc(@x/Z) = D? (Proposition |14.3.4), we get 


Hence disc(@x/Z)«; is an algebraic integer (in @,). Since the {; are an integral basis for 
OG, this forces r | disc(@x/Z). Similarly, r | disc(@,/Z), as needed. 
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Now we prove the second part: Choose (4, ..+;Qm) a basis for a and a eevee: 


Sg SN 


the eoouinsts Of ay Beit in y. Then the mn x mn matrix 


[tr (ai, Fino Bix )| eal (Qi, Binoy, - 


Sf SNe yt SSIS 


SJ Sine SS 


1<j<m 1l<k<n 


~ hss 
t's > (cg, Oy 05), (8:2 Bi, Br) k 
m 
a 


S- 4 Oj Gt a;)5 (Bin Bx, Px) 1 
= [tr(ai, ay )] @ [tr(Fi2 8, )]. 
Taking determinants and using 
det(A ® B) = det(A)" det(B)”, AE Minxm, BE Mnxn 


we get 


disc(K L/Q) = disc(K/Q)#% dise(L/Q)™@%. 


[ADD an algorithm for computing integral bases] 


5 Problems 


1. Suppose that f € Z|z] is irreducible and has a root of absolute value at least 3. Prove 
that if a is a root of f then f(a? +1) 40. 


2. Let a1,...,@, be algebraic integers with degrees dj,...,d,. Let a},...,a), be the con- 
jugates of aj,...,@, with greatest absolute value. Let cy,...,cn, be integers. Prove 
that if the LHS of the following expression is not zero, then 


1 dydg:--dyn—1 
Cja, +...+c,a,| > . 
ol eae ee) 


For example, 


1 3 
\cy + V2 + 3V3| > ( ) : 


|e1| + |2ce| + |2cs| 


3. Let p be a prime and consider k pth roots of unity whose sum is not 0. Prove that the 
absolute value of their sum is at least [-s 
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Chapter 15 


Ideal factorization 


factorization 


1 Discrete Valuation Rings 


Definition 15.1.1: Let K bea field. A discrete valuation on K is a surjective function 
uv: K* — Z such that for every x,y € K%, 


1. m is a group homeomorphism: v(ay) = v(x)u(y). 
2. u(e+y) > min(v(z),0(y)). 
We set u(0) = oo. 
A discrete valuation ring (over Z) is a local integral domain R (not a field), whose 


fraction field has a discrete valuation v. 
An element t with v(t) = 1 is a uniformizing parameter. 


Proposition 15.1.2: Suppose R is a DVR with fraction field kK. Let v be the valuation on 
fe 


1. The units are exactly the elements with 0 valuation: 
="). 
2. Its maximal idea is the set of elements with positive valuation. 
m= {ss o(e) > OU}. 
3. Ris a PID with ideals m” = {x : v(x) > n} = (t") forn EN. 


4. Ris a UFD; any element can be written uniquely in the form ut” where wu is a unit. 
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Lemma 15.1.3: Let A be a local domain with maximal ideal m principal and nonzero. If 
Ano m” = 0 then A is a DVR. 


Theorem 15.1.4: Let (A,m) be a Noetherian local domain. The following conditions are 
equivalent. 


1. Aisa DVR. 


2. A is a normal domain of dimension 1. (Dimension 1 means that the longest chain of 
prime ideals is 2: py C p;.) (Since A is local this means it has only two prime ideals.) 


3. A is a normal domain of depth 1. (There is a nonzero x € A with m € Ass(A/ (z)).) 


4. Aisa regular local ring of dimension 1. (Regular means its maximal ideal is generated 
by a number of elements equal to its dimension. So here it means m is principal.) 


5. m is principal and nonzero. 


Proof. Note (5) ==> (1) uses Krull Intersection Theorem: For R a Noetherian ring, a an 
ideal, and M a finitely generated module (esp. when M = R), then there exists x € a such 
that 

(1+2)() aM =0. 


n=0 


2 Dedekind Domains 


Definition 15.2.1: A Dedekind domain is a normal Noetherian integral domain A such 
that every nonzero prime ideal is maximal. 


Proposition 15.2.2: A local integral domain is Dedekind iff it is a DVR. 


Proposition 15.2.3: For every nonzero prime ideal p in a Dedekind domain A, the local- 
ization A, is a DVR. (Locally, Dedekind domains are DVR’s.) 


(The converse, i.e. if Ay is a DVR for every p, then A is Dedekind, holds using Serre’s 
criterion. ) 


Theorem 15.2.4 (Unique factorization of prime ideals): urceackina Let A be a Dedekind 
domain. Every proper nonzero ideal of A can be written uniquely as a product of prime 
ideals. 


Proof. Let a be a proper nonzero ideal of A. 
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1. If A is Noetherian, then every ideal a C A contains a product b = J] p;* of nonzero 
prime ideals: Otherwise, choose a maximal counterexample a (possible since A is 
Noetherian). Since a is not prime, there exist x,y ¢ a such that zy € a. By the 
maximality assumption both a+ (xz) and a+ (y) contain a product of prime ideals, and 
so does a D (a+ (x))(a+ (y)). 


2. By the Chinese Remainder Theorem 
A/b = ][ A/pe 
k 


via the natural map. 


3. If p is a maximal ideal in a ring A, and q = pA,, then the natural map A/p” —> 
(A/p™),) = Ay/q™ is an isomorphism. (Indeed, it is injective because p is prime and 
surjective because any s € A—p is invertible modulo p”™, on account of (s) +p’ = A.) 


Thus 
TT A/pe' = TD An. /an' 
k k 
(This is where we use the fact that nonzero prime ideals are maximal.) 


4. Combining the above, we get a one-to-one correspondence between ideals in A contain- 
ing b, and ideals in J], Ap,/q;". All ideals in the last ring are in the form [], q;°/q;", 
so a is of the form J], q;". Moreover, different prime ideals containing 6 correspond to 
different T], q;"/q;,", which are different for different s;, giving uniqueness. 


Corollary 15.2.5: urdedekina-cor Let A be a Dedekind domain. 
1. fa=T],p;,° and 6 =[], p;* are ideals in A and p is a nonzero prime ideal then 
ab <> 7r,> 8, forallk 


<= aA, 2 bA,y for all p. 


2. Ifa > 6b ¥ 0 are ideals in A then a = 6b + (a) for some a € A. In particular, if b € a 
then there exists a € A such that a = (a,b); i.e. each ideal is generated by at most 
two elements. 


3. (Inverses) Let a 4 0 be an ideal of A. There exists a nonzero ideal a* such that aa* is 
principal. 


(a) We can choose a* so aa* = (a) for given a € a. 


(b) Alternatively we can choose a* to be relatively prime to a given ideal ¢ £ 0. 


Proof. 1. The forward direction was shown in the course of the theorem. The reverse 
directions are easy. 
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2. Choose any a € a\{0}. By unique factorization, we can write 


for primes p,,...,p, and u; > v; > 0. Now choose b; € par By the Chinese 
remainder theorem we can choose 6 such that b = 6; (mod p) for all 7. Since 
ordy, (bj) = v;, by item 1, the highest power of p; dividing (b) is v;. The highest power 
of p; dividing (a) is u; > v;, so the highest power of p; dividing (a,b) is v;. Now for 
a prime q ¢ {p1,...,p,}, we have a ¢ q (else q would divide a), so q does not divide 
(a,b). We conclude 


(a,b) = py ++ p,”, 
as needed. 


3. (a) follows from item 1; for (b), use item 2 and 3(a) to write a = ac+(a) = ac+aa* = 
a(c+a*). 


Theorem 15.2.6: extension-dedekina Assume AKLB, and K/L is finite separable. If A is a 
Dedekind domain, then so is B. In particular, taking A = Z and K = Q, every ring 
of integers in a finite separable extension of Q is Dedekind. [Separability is not needed. 
TODO: proof of general case, Janusz 1.6.1] 


Proof. 


1. B is noetherian: By Proposition |14]14.4.1| B is a finitely generated A-module, hence 
a Noetherian A-module, hence Noetherian as a ring. 


2. B is integrally closed by Proposition |14/14.1.11(2). 


3. Every nonzero prime ideal q of B is maximal: Take a nonzero ( € q and let its minimal 
polynomial be x” + a@n_12" 1 +---+a,. Then a, = —8"—---—a8 € BBNAC QnA. 
This shows qM A # 0; since A is Dedekind and q/N A is prime, qM A is maximal and 
A/gq is a field. Since B is integral over A, B/q is integral over A/q. 


Lemma 15.2.7: int-com-rcia An integral domain B containing a field & and algebraic over 
k is a field. 


Proof. Let 6 € B be nonzero. Then k|3] is a finite dimensional vector space and the 
multiplication-by-3 map mg : k[B] + kG] is injective, hence surjective. Thus there 
exists 6’ so 68’ = 1, i.e. B has an inverse. 
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The lemma shows B/q is a field. Hence q is maximal. 


Alternatively, this follows directly from “lying-over” and “going up” for integral exten- 
sions. 


Theorem 15.2.8: cicedckina Suppose K is a finite extension of Q. Then unique factorization 
of ideals holds in Ox. 


Proof. Combine Theorem }|15.2.4| and Theorem |15.2.6 


3 Primary decomposition* 


[ADD: Commutative algebra generalization, and a new proof of unique ideal factorization] 


4 Ideal class group 


Let A be a Dedekind domain with fraction field kK. 


Definition 15.4.1: A fractional ideal of A is a nonzero A-submodule of K such that 
da € A for some d€ A. 
A principal fractional ideal is one of the form 


(b) := bA := {bala € A}. 
The product of two fractional ideals is 
ab = {)-a,b\a; € a,b; € b}. 


Note that given a nonzero A-submodule of K, it is finitely generated iff it is a fractional 
ideal. (Take common denominators of the generators. ) 

We can extend unique factorization to fractional ideals, in the same way that we can 
extend unique factorization from Z to Q. 


Theorem 15.4.2: The set Id(A) of fractional ideals is a free abelian group on the set of 
prime ideals. Thus each fraction ideal can be uniquely written in the form 


a=[[p”. 


Proof. Freeness follows from unique factorization (Theorem|15.2.4) and existence of inverses 
follows from Corollary |15.2.5(3a). 


Now we are ready for the following definition. 
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Definition 15.4.3: Let P(A) be the group of principal ideals of A. The ideal class group 
Cl(A) is Id(A)/P(A). Its order is the class number. 

The ideal class group and class number of K are defined as the ideal class group and 
class number of Ox. 


Note that we have an exact sequence 
0 > P(A) > I(A) > CI(A) > 0. 


The class number is 1 iff all A is a PID. Thus in some sense it measures how far A is 
from being a PID. 
Alternatively there is an exact sequence 


1 Of > K* > In > Cle 9 1 
where the map K* -— K is given by a+> (a). 


Theorem 15.4.4 (Approximation Theorem): Let 71,...,2%m € A, and f1,..., Pm be distinct 
prime ideals. For any x € N, there is x € A such that 


ord,, (2 — 24) >n 


for all 2. 


Proof. Immediate from the Chinese Remainder Theorem. 


5 Factorization in extensions 


Assume AKLB, with A Dedekind and L/K finite separable. A prime ideal p C A will factor 
in B: 

pB = bre i pS 
We say e; is the ramification index of B;. For PB | p, we write e(8/p) for the ramification 
index and f(%8/p) for the residue class degree |B/38 : A/p]. 


1. Ife, > 1 for some k, p is ramified in B. 


(a) If g =1 and e; > 1, p is totally ramified. 
(b) When |A/p| = p”, p prime, and p { |[B/8: A/p], then p is tamely ramified. 


2. Ife; = f; = 1 for all 7, p splits completely. 
3. If pB stays prime, p is inert. 


Lemma 15.5.1: aiv-in. A prime ideal 8 divides p iff BOK = p. 
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Theorem 15.5.2 (Degree equation): ces-eq Let m = [L : K] and suppose pB = Bi - - - Bee. 


Then : 
» eifi =m. 
i=l 


If L/K is Galois, then all the e; are equal and all the f; are equal. Letting e and f denote 
these common values, 


efg=m 


Proof. We show both sides of the equation equal dim /,(B/pB). 
For the LHS, by the Chinese Remainder Theorem B/pB & J]#_, B/38f' so 


g 
i=l 
Consider the filtration 
BoP De De. 
There are no ideals between any two consecutive ideals by Corollary |15.2.5} (the first iff), s 


there are no proper B/‘f;-ideals (i.e. subspaces) of 87/37 t'. Hence dims Be; (Bt /Pi**) = 1 
and dim 4/)($7/98;"") = fi. Thus 


deg-eq-2 dim 4/)(B/¥;") = ; fi. (15.2) 


Combining (15.1) and (15.2) give 
dim4/,(B/pB) = Yih 


For the RHS, let A’ = (A—p)~'A = A, and B’ = (A —p)'B. First note that 
A/p = Frac(A/p) = (A/p)p = A'/pA' 
and 
B/p = (A—p)'(B/pB) = B'/pB’, 
where in (*) we use the fact that all elements of A —p are invertible modulo pB, on account 
of A/p being a field. Note A’ is aa DVR and hence a PID. Since B is finitely generated over 
A, and localization is exact, B’ is finitely generated over A’. Furthermore, B’ is A’-torsion 


free. The previous three statements along with the Structure Theorem for Modules gives 
that B’ ~ A’ (as A’-modules) for some n. Perform the following operations: 


B’ ~ Al 
e/pe 
ek 
KS 1 B'/pB' = (A'/pa’y” 
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Hence 
[L: kK] =n=dimy, B/pB 


as needed. 
Now suppose L/K is Galois. Then G(L/k) permutes the primes $8 dividing p. Since 
e(3B/p) = e(oB8/p) and f(8/p) = f(oB/p), it suffices to show G(L/K) acts transitively. 
Suppose by way of contradiction that $B and Q are not in the same orbit. By the Chinese 
Remainder Theorem there exists 6 € Q — {oP | o € G(L/K)}. Now 


Nmzx(8)= [J off)€QnA=pCR, 


o€G(L/K) 


the first because 3 € Q and the second because ( € B is integral over A (which is integrally 
closed in Kk’). But o(3) ¢ $B so 


IL o() €R, 


o€G(L/K) 


a contradiction. 


Note that the ramification indices and residue degrees multiply under field extension. 


Proposition 15.5.3: cemuttipiy Suppose that M/L and L/K are finite separable extensions 
(with Dedekind ring of integers), and that Q | 8 | p are primes in M,L, K respectively. 
Then 


e(Q/p) = e(2/B)e(B/p) 
F(Q/p) = F(Q/B) FB/P) 


Proof. The first comes from substituting the factorization of S8@), in the factorization of 
pO,. The second comes from multiplicativity of degrees of field extensions. 


6 Computing factorizations 


Theorem 15.6.1 (Criterion for ramification): cit-ram Assume AKLB, with L/K finite, A 
Dedekind, and B free over A. (The last condition is satisfied when A is a PID.) Then p 
ramifies in L iff p | disc(B/A). In particular, only finitely many prime ideals ramify. 


Proof. 
1. If A is a ring, B is a ring containing A and admitting a finite basis {e),...,en} as 
an A-module, and a is an ideal of A, then {@,...,@m} is a basis for B/aB as a A/a 
module, and D(é&,...,@m) = D(e1,...,€m) mod a. Hence 


disc(B/A) mod p = disc((B/pB)/(A/p)). 
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Lemma 15.6.2: Let k be a perfect field and B be a k-algebra of finite dimension. 
Then B is reduced (has no nilpotent elements) iff disc(B/k) 4 0. 


Proof. First suppose 6 4 0 is a nilpotent element of B. Choose a basis e; = 2, €2,.-.,€m 
of B. Then Ge; is nilpotent, so has trace 0. The first row of (tr(e;e;)) is zero, so 
disc(B/k) = det(tr(e;e;)) = 0. 


Now suppose B is reduced. By the Scheinnullstellensatz, Mp primep = nil(R) = {O}. 
Since B/p is integral and algebraic over k, Lemma [15.2.7] shows it is a field. Hence p 
is maximal, and different p are relatively prime. Let p,,...,p, be prime ideals of B. 
By the Chinese Remainder Theorem, B/();_, pi = T[j_, B/pi so 


dim, B > dim, (2) (A) rs) = > dim,(B/p;) > r. 


i=1 i=1 
Since dim; B is assumed finite, B has only finitely many prime ideals, say p,,..., py. 


Each B/p; is a finite separable (as k is perfect) extension of k, so by Proposition|14]14.3.4{2) 
(nondegeneracy of trace pairing), disc((B/p;)/k) #0. Since B = B/N, p; = ITZ Pi, 
by taking the union of the bases for B/p;, we get disc(B/k) 4 0. 


3. Let pB = J], 8%’. From the lemma, since A/p is perfect (as it is a finite field), 


disc((B/pB)/(A/p)) = 0 


iff B/pB is not reduced. By the Chinese Remainder Theorem B/pB = J], B/SBF", and 
this is nonreduced iff some e; > 1, i.e. p ramifies. 


Theorem 15.6.3 (Computing the factorization of pB): compute-actpp Assume AKLB, A is 
Dedekind and L/K is separable. Suppose B = Alfa] and f(X) is the minimal polynomial 
of a over K. Let p be a prime ideal in A, and suppose f(X) factorizes into irreducible 
polynomials modulo p as 


i(X) = I gi(X)* (mod p). 


Then . 
pB = [[(p, g:(a))* 


i=1 


is the prime factorization of pB. Moreover, letting g; = g; mod p, 


B/(p, gi(a)) = (A/p)[X]/(%) 
fi = deg yj. 


Generalize to when p relatively prime to conductor. 
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Proof. The map X + a gives an isomorphism 
A[X]/(f(X)) = B. 


Modding out by p gives 7 
k{X]/(f(X)) = B/p. 
This gives a correspondence between ideals in kLX]/(f(X)) and ideals in B containing p: 


Maximal ideals of kLX]/(f(X)) (Gi) 
<— Maximal ideals of B/p (gi(@)) 
<—> Maximal ideals of B containing p (p, gi(a)) 


But the maximal ideals of B containing p are exactly the prime ideals (since B is Dedekind) 


dividing p (Lemma/]15.5.1). 


Now J](g;)* = 0 but no power with smaller exponents is 0. Hence pB D [](p, g;) but 
does not contain any power with smaller exponents, and equality holds. 


Note that the condition that p be relatively prime to the conductor is somewhat pesky. 
The problem is that the we may have prime ideals dividing p that are in the form (p, g(q)) 
where g does has coefficients with elements of p in the denominator. So looking at the 
polynomial modulo p fails to capture this behavior. We can’t look at them modulo a power 
of p either—because then we would not be in a field. The solution is to pass to the completion 
with respect to p—we will do this in Chapter ??. 


Example 15.6.4 (Quadratic extensions): quaa-ext-primes 


Prime p x*+1mod p (p) 
2 (2 +1)? Ramifies: (i + 1)? 
- p=1 (mod 4) factors since (=) = Splits 
p = 3 (mod 4) | irreducible since (+) =-1| Remains prime 
Prime p z* +2 mod p (p) 
2 x Ramifies: (\/—2)? 
- p =1,3 (mod 8) factors since (=) =i Splits 
p = 5,7 (mod 8) | irreducible since (=) =-—1 | Remains prime 
Prime p z*+a2+1modp (p) 
3 (x—1)? Ramifies: ere y 
e p =1 (mod 3) factors since (=) =(2)=1 Splits 
p = 2 (mod 38) | irreducible since (=) = (2) =-1 Remains prime 


Note we used quadratic reciprocity to translate the “square” condition into a modular condi- 
tion on p. This is true in general for any quadratic ring: whether a prime p splits is entirely 
determined by a modular condition on p, because of quadratic reciprocity. 
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Mention some geometrical intuition. 


7 Decomposition and inertia groups 


sec:decomposition-and-inertia Let L/K be a finite Galois extension, with residue fields l and k. 
For a prime p of K, we know that there are three kinds of behavior it could express when 
we pass to L: 


1. It can split into distinct primes 5B),..., By. 
2. The primes have some residue degree f = [@,/8; : Gx /p| over p. 


3. There can be ramification, the primes $8; appearing with exponent e. 


Moreover, |L : K] = efg. We would like to separate these three kinds of behavior by defining 
two intermediate extensions L?(®) and L!‘®), 


Definition 15.7.1: Let 8 | p be primes in L and K. 
The decomposition group of ‘¥ is 


Dijk (B) = {a € G(L/K) : oP) = PF. 
The inertia group of ¥ is 
Tr j«K (8B) = {a € G(L/K) : o(a) —a € # for alla € Gy}. 
Equivalently, letting 1, k be the residue fields of L and K, I;,/« (8) is the kernel of the map 
e: D(%) > G(l/k). 


We drop the subscript when there is no confusion. The main theorem is the following. 


Theorem 15.7.2: cccomposition-andinertia Let L/K be a finite Galois extension with residue fields 
l,k, with 1/k separable.! Let $8 | p be primes of L and K. Let e, f,g be the ramification 
index, residue class degree, and number of prime divisors of p in L. 

Let Bp = POL?) and P; = PN L!®) (the fixed fields of the decomposition and 
inertia groups). Then the following hold. 


1. [L: L7®) =e and ; totally ramifies in L/L/). 
Pre. = ¥. 
2. [L'®) : LP) = f and Bp remains inert in the extension 11) /L?), 


Bo hie = Br 
f(B1/Bo) = f. 


Moreover, L!*)/K is Galois. 


‘Tf 1/k is not assumed separable, then [L : L7‘P)] = e[l : k];, [L7) : LP] = [1 : k],, and [L?™) : L] = g. 
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3. (LP) : K] = g, and p splits completely in L?) if LP) /K is Galois?: 
pO, = B1,p°-: Bgp. 


We have the following picture. By Galois theory, the groups on the right are the Galois 
groups acting on each extension; we set G = G(L/K). 


B L 
iste nacontien = e 1(38) 
Br : Lie) 
met g_—s F DEBTOR) 
Bp : LD) 
totally split if Calis ; : G/D(®) if Galois 
| fe 


Remark 15.7.3: To study ramification, we can define subgroups of /(38) called ramification 
groups and get fixed fields in between L and L!). See Chapter |22| 


The rest of this section is devoted to the proof of Theorem|15.7.2| We keep the notations 


and assumptions in the theorem. 


7.1 Decomposition group 


Proposition 15.7.4: The decomposition group D(8) has order ef, and for 0 € G(L/K), 


Moreover, the following are equivalent: 
1. D(3B) is normal in G. 
2. The groups D(Q) are equal for all Q | p. 
3. LP) /L is Galois. 


Proof. Since D(B) is the stabilizer of $8 under the action of G := G(L/K), |G/D()| is 
simply the size of the orbit of G. This equals g since G acts transitively on the primes 
B1,..., 9, above p. Hence 


_ |G| _n_, 
PO = Tajpepy ~ 9 


?This is actually an iff. Exercise! 
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The second part follows from the fact that if G acts on S and G is the stabilizer of s € S, 
then tGt~! is the stabilizer of ts. 

For the equivalences, use the second part and the fundamental theorem of Galois the- 
ory |12]12.4.1 


We first show that 8p is non-split in L and prove item 3 of Theorem |15.7.2 
By the Fixed Field Theorem, D(8) = G(L/L?™)), and 


decomp-et|L : LP] — | D(9)| = ef. (15.3) 


Since L/L?) is Galois, D(B) acts transitively on the primes of L above Bp. However, 
D() stabilizes $B; thus $B is the only prime above Pp. 
By the degree equation, 


ef =[L: LP) = e(B/Po) f(Po/P). 
By Proposition 


e = e(P/PBo)e(Pod/p) 
f = f(B/Bo) f (Bod/p). 


All equations are satisfied only when e = e($/Bp), f = f(P®/Bo), and e(Pp/p) = 


f(Bod/p) = 1. 
If L?®) is Galois, then e(Bp/p) = f(Bp/p) = 1 are the same as the e and f values for 


all primes in L?®) over L. Thus p is totally split over L. 


7.2 Inertia group 
First we study the homomorphism 
é: D(3B) + G(I/k). 


Proposition 15.7.5: Suppose 8 | p are primes in L and K, and let k and | be the residue 
fields of L and Kk with respect to 8 and p. 


1. 1/k is normal (and hence Galois if separable). 


2. Let ¢ be the map D(B) > G(I/k). Then « is surjective. 


Proof. Let G = G(L/K). 


1. We need to show that for @ € I, its minimal polynomial over k splits completely. Let 
a be a lift to @;, and let 


{Ayes us — o(a)) € Ox|X]. 


Taking this modulo ‘8 gives a polynomial in k[X] containing @ as a root and splitting 
completely. 


Thus //k is normal, and hence Galois if it is separable. 
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2. First note we may assume I/k is separable. Indeed, we have G(I/k) = G(Is°? /k)?. 


It suffices to show that ¢(D(B)) acts transitively on the conjugates of @ over k (as 
then the image has at least [I : k] = |G(l/k)| elements). By the Chinese Remainder 
Theorem, choose a € @;, such that 


_ (a (mod 8) 
10 (mod $'), YA P,P |p. 


Define f as in item 1. Then, noting that for 0 € G\D(B), we have a = 0 (mod o~1()) 
and hence o(a) = 0 (mod $B), 


F(X)= TI X%-c(@)) I] 


o€D(%) o¢D(%) 
= JI (%-e)@) TI eax] 
o€D(%) o¢D(B) 


Now (x) is in k{a], so is divisible by the minimal polynomial of a over k. Given a 
conjugate a’ of a, it divides («), so equals (e(7))(@) for some o. 


Corollary 15.7.6: sesinertia-decomp There is a short exact sequence 
1 I(%) — DB) — G(l/k) > 1, 


i.e. D(®)/I(B) & G(L/k). 


Note I(98) is normal in D(9B) as it is a kernel, so L/‘®)/K is Galois. 
Now we finish the proof of Theorem |15.7.2| The above corollary gives 


ID(B)/TB)| = |GU/k)| = [bk] = f. 


Since G(L7®) /LP)) = | D(9B)/T(B)| = f, we get (LP) : LP®] = f. From (15.3) we get 
[2 i) Se. 
We will apply Corollary |15.7.6}to L/L!). Note 


Diy (B) = Lyyrre (B) = G(L/L*™) = IP) 


since the fact that /(98) operates trivially on //k implies that it operates trivially on //«K(B;). 
Hence the corollary gives 


G(l/«(Br)) = 1, 


3From the Fixed Field Theorem 1/ IGM") is Galois. But 1/18? is purely inseparable and normal. Thus 
we must have | = 1¢/""") i.e. every automorphism of I/k is trivial on 1/I°°?. 
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ie. 1 = K($B;) and f(B/B7) = 1. We know that Bp is non-split in L, so 


e=([L: L'™)| = e(/1) f (B/B1) 


f=(L™ : LPP) = e(%z/Pp) f (B1/Po). 


Now 


e = e(B/Pp) = e(PB/Pr)e(Br/PBd) 
f= f(B/Bod) = fB/B)L(Br/Pd), 


so we must have 


e(B/Br) =e, ppp 
e(P7/p) = 1, f(Br/p) = f. 


This finishes the proof. 


7.3 Further properties and applications 
Theorem 15.7.7: Let M/K be a Galois extension and L/K a subextension. Then 
1. 


Duyji(B) = Dux (B) A G(M/L) 
Tui (B8) = Ivy OB) N G(M/L). 


2. If L/K is Galois, the following commutes and has exact rows and columns. 


1 1 1 
1 >I ust > Lure ——— lijx ——— 1 
1 Duyt Dix ———> Dijx —— 1 


1—+G(M/L) —+G(M/K) —>G(L/K) —>1 


~ ve ~ 


1 1 1 


Theorem 15.7.8: unram-in-compositum Let L/K and L’'/K be finite extensions. Then p unramified 
in L, L’ if and only if p is unramified in LL’. 
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Make notation consistent 


Proof. First we prove the result for L, L’ Galois. Note that for any Galois extension M/K, 
with $8 |p primes in M and kK, 


inertia-1-iftunramified’yy = 1 <=> p unramified in M. (15.4) 
Now there is a injective homomorphism 
®: G(LL'/K) 3 G(L/K) x G(L'/K) 
O(c) = (ol, oly). 
Take Q |p with Q a prime in LL’, and let B= QN CG, and P’ = NN GL. Suppose a € Ig. 


Then o(Q) = Q and hence, taking the intersections with @;, G1, (which are fixed by o since 
L, L' are Galois) 


o|L(B) = PB 
oly (P') = P. 

This shows o|, € Iy,a|z € Igy; by assumption and (15.4), we get (o|z, |x) = (1,1). By 
injectivity of ®, o = 1. This shows Ig = 1, by again, we get Q is unramified over p, 
as needed. 

Now consider the general case. Given 8 | p in ZL and K, let Q be a prime above $B in 
the Galois closure L8"!. Now (L&*!)/2/"/4) is a Galois extension containing L; since L£*! is 
the Galois closure of L, we get 


pst = (ety). 
But (Le! ; (Le!)lo/*"/1)) is the ramification degree of 2/38; we see that it is 1, ie. Q is 


not ramified over $8 and hence not ramified over p. Thus L8"/K is unramified. Similarly, 
L'8"'/K is unramified. By the above, L8!L’8*'/K is unramified, so LL'/K is unramified. 


8 Problems 


sec:factorization-problems 


1. A half-factorial domain (HFD) A is an integral domain where any given factorization 
of a has the same length. Prove Carlitz’s Theorem: 


Theorem 15.8.1 (Carlitz): The ring of integers Ox is a HFD iff the class group has 
order at most 2. 
See AMM, 12/2011, for related results. 


2. Show that if p splits completely in L?®), then L?)/L is Galois. 


Conclude that if p splits completely in L, then p splits completely in the Galois closure 
Le" 
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The class group 


class-group 


1 Norms of ideals 


Assume AKLB, A is Dedekind, and L/K is separable. We generalize the definition of norm 
to ideals, not just elements, so that it is a map Id(B) — Id(A) that is consistent with our 
old condition, i.e. 


Consider a principal ideal p = (p) C A, and suppose it factors in B as pB = J[%_, PB“. We 
want the norm to satisfy 


g 
norm-ideal-motivate Nmz/x (p) = Nmz/x«(pB) = Il Nmz/x«(B)%, (16.1) 
i=1 


since we want it to be multiplicative. But Nm(p) = p” where n = |[L: K]. By the degree 
equation, if Nm(B) = P/ where f; = [B/; : A/p], then (16.1) will be satisfied. Hence we 
make the following definition. 


Definition 16.1.1: For $ is a prime of B, let p = BOA and f(P/p) = [B/P : A/pl. 
Define the norm of 58 to be 


Nmzyic(P) = pr”), 


This extends uniquely to a homomorphism Id(A) — Id(B), since the ideal group is free. 
Proposition 16.1.2 (Behavior with respect to field extensions): 


1. For an ideal a C A, 
Nmz/x«(aB) =a", 


where m = [L: K). 
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2. If L/K is Galois and p ¥ 0 is a prime ideal of A, and $B | p, then 


o€G(L/K) 


3. For any nonzero 6 € B, Nmz/x«(8B) = Nmz/x«(8)A. (Le. this is consistent with our 
previous definition. ) 


Compare the first two items to Chapter[14] Proposition|14.2.2(5) and Proposition|14.2.3{2b), 
respectively. 


Proof. 


1. By the degree equation (Theorem |15.5.2), for p prime 
Nmz/«(pB) = Nmz/x (11 ¥°) = pric =p, 


The general statement follows by multiplicativity of Nmz x. 


2. G(L/K) acts transitively on {P1,..., By}, so each P; occurs * = ef times in {o%P | 
0 € G(L/K)}. 


3. First suppose L/K is Galois. We use the description in terms of Galois conjugates 
to relate the norms of elements with the norms of ideals. By part 2 and Proposi- 


tion 2b), we have 


II o(an) = ( II o()) BM Nin x(8) - B. 
o€G(L/K) o€G(L/K) 


IIS 


Hence, Nmy/«(8)-A and Nmz/x«(-B) determine the same ideal in B. Since Id(A) > 
Id(B) is injective, they are equal in A. 


Now consider the general case. Let M be the Galois closure of L over K, let C = Oy, 
and let d= [M : L]. Then the above, together with part 1 and Proposition}14]14.2.2(5), 
give 


Nmz/x«(G ; Bye Y Nmy/x(G . B) = Nmy/x (8) “A Hee Nmz/x(8)4 A, 


Since Id(B) is torsion-free, Nmz/x(G-B) = Nmz/K (8) - A. 


Definition 16.1.3: The numerical norm of a in @x is its index in the lattice of integers: 
Na = [Ox : al. 
Note the following comparisons between the ideal and numerical norms. 
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1. The ideal norm is defined for a field extension K/F’ while the numerical norm is defined 
for any number field K/Q. 


2. The ideal norm returns an ideal while the numerical norm returns an integer. 


3. However, if we take the base field F' to be Q, and identify integers with the ideals they 
generate, the two norms are equivalent. This is the content of the following proposition. 
Proposition 16.1.4 (Relationship between ideal and numerical norm): 


1. For any ideal a C @x, 
Nmxjo(a) = (3(a)). 
Therefore, St(ab) = N(a)N(b). 
2. Let 6b CaCK be fractional ideals. Then 
[a : b] = N(a*b). 


In other words, the norm of an ideal is its index in the ring of integers. 


Proof. 


1. Write a = J] pj? and let (p;) = ZN pi, fi = flpi/(p:)). By the Chinese remainder 
theorem, 


Ox/a= I] On |p; 


Since Ox /p;' is a vector space over F,, of dimension e;f;, we find 


Na = |Ox/a| = T] pi" = Nmxjo(a). 


Multiplicativity follows from the same property for the ideal norm. 


2. We can multiply by an integer d so that a and 6 are integral ideals. Then 


[a : b] = [da : db] = a | i ~ de © 9/0716). 


2 Miinkowski’s Theorem 


Theorem 16.2.1 (Minkowski): Let V be a subset of R” that is convex and symmetric 
around the origin (“centrally symmetric”). Let L be a full lattice with fundamental par- 
alleopiped D. If 


wT’) > 2" u(D) 
then T contains a point of L other than the origin. If furthermore D is compact, we can 
weaken the hypothesis to 
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Proof. First note that if S is a measurable set such that w(S) > u(D), then S contains 
two points a,b such that a — b € L. Indeed, we can tile the space with fundamental paral- 
lelopipeds, and translate each of them to the origin. We consider the intersections of these 
parallelopipeds with S. Since the sum of these volumes is (5) > u(D), and they are all 
packed in D, there must be overlap, i.e. unequal a,b € S that were translated to the same 
point. This implies a—b € L. 

The set S = $T has volume aT > pu(D). Hence by the above, there exist Sa f sb ep 
(a,b € T) such that $a — $b € L. Since T is symmetric, —b € T; since T is convex, 
5(a—b) € T. This is the desired lattice point. 

Now suppose instead T is convex and p(T’) > 2"u(D). Let L,, be the set of lattice points 
in (1 + +) T other than the origin. By the first part, L,, is nonempty; since T is bounded it 
must be finite. We have that L,, C L,, when n > m. Hence 


LiL= f) (1+-)TnL= () Ln # ¢. 
n=1 n=1 


Theorem 16.2.2 (Sums of four squares): (A digression, but nice to talk about) 


3 Finiteness of the class number 


finite-class There’s a more natural way to “transfer” the inner product on C” to R” x C*... 
We now show that the class number is finite (Theorem |16.3.6). The idea of the proof is 
as follows. 


1. Embed K as a Q-vector space in R” x C*. Under the R-vector space isomorphism K @g 
R — R’ x C®, the ideal a is realized as a lattice L in V = R" x C® (Proposition 16.3.1). 


The norm on K translates into a “norm” on V. 


2. Find an element in a of small norm (Theorem }16.3.2): Find a compact, symmetric 
convex set in V consisting of elements of norm at most R. Choosing R large enough, 


we can make sure V has large volume. By Minkowski’s Theorem, V contains an element 
of L. 


3. Using step 2, show that every ideal class contains an representative of norm at most a 


constant (Theorem |16.3.5). 


4. Show that there are a finite number of ideals with bounded norm (Lemma|16.3.7). 
We first embed a as a full lattice using the embeddings of K, and find the volume of the 
fundamental parallelopiped in terms of the discriminant (the discriminant is related to the 


embeddings by Proposition |14.3.4). 
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Let {o1,...,0,} be the real embeddings and {0,41, O41,---;Ort+s, r+s} be the complex 
embeddings of K. This gives an embedding! 


a: KoORxC 


CG) = (01x inne): 


Identify V = R” x C* with R” using the basis {1,7} for C. 


Proposition 16.3.1: iacatiaitice Let a be an ideal in Ox. Then o(a) is a full lattice in V and 
the volume of its parallelopiped is 2~*- Na- |Ax|?2. 


Proof. Let a1,...,@, be a basis for a as a Z-module. To prove that o(a) is a lattice, 
we need to show o(a;1),...,0(Q@,) are linearly independent, i.e. the following has nonzero 
determinant: 
oi(a1) +++ o(a1) R(or+1(a1)) S(or+1(a1)) 
ss 


A= | (a2) +++ o-(a2) R(or+1(a2)) 


To do this we relate this to the matrix 


oi(a1) +++ Or(an) Or4i(1) Or41 (01) 
B=| o1(a2) ++: or(a2) Or4i(Q2) oOr41(a1) 


1 1 
Note det(B) = +disc(ay,...,n)? #0. Let J = € %, ). Then 
2 
0 
0 
J 


Using 
disc(ai,...,Qn) = [Ox : a]? - | disc(@x/Z)| 


we get that the volume of a fundamental parallelopiped for D is 


u(D) = | det(A)| = 2-*| det(B)| = 27*| disc(ay,...,Q,)|2 = 27° - Na- |Ax|?. 


(In particular, this is nonzero.) 


'This is the canonical embedding K + K @g R: Indeed, by Chinese Remainder 


K 8g R= Q[z]/(f(z)) @gR= ][®2//@- aja) X [[&tl/@ 0,450 (t= tea) = eC, 


i=l 
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Theorem 16.3.2: iacat-representative Let a be a nonzero ideal in Ox. Then a contains a nonzero 
element a of K with 
[Nm(a)| < (2) Eval? 
m(a@ —} —Na . 
ag eg er K 
Proof. The norm on K translates into the “norm” 
Nm(r1, vey Dry Sry ees Pets) _ kan saa |r |[Zr1/? oe Laoxel 
However, Nx < r is by no means a compact convex set. Fortunately, however, we note by 
the AM-GM inequality that 
Die=t [Pal + 2 Vihar [rte 
n 


). a6.) 


normAned-amnam| Nm(x)| = |x1| vss a, | |geaal aS lias |" < ( 


Defining the norm ||-|| on V = R” x C® by 


r s 
Il (x1, a es » Zr-+8) | = > |x; +2 [zal, 
k=1 k=r+1 


and letting B(t) = {x €V: ||z|| < t}, B(Nm,t) = {2 € V: |Nm(z)| < t}, we see from (16.2) 
that t” 
norm-ined-am-ama.D(t) c B (Nm, —) . (16.3) 
nr 


To apply Minkowski we need some computations. 


Lemma 16.3.3: class-volume The volume of Bit) = 12 — V 7 {||| < t} is 
Tr—s gv" 
u(B(t)) = 2° a. 


Proof. We write the complex variables as z, = x, + yRi. Let 
B'(t) = {(21, vey Ur, Xpt 1s Ur41s ++ tte, Mises) = Bit) + U1, + ++, Ur = O} : 
Write dV = dx,---dx,. Using symmetry and a polar change of coordinates, we compute 


votumett(B(t)) = 2° i WV ler Wes dines dirt (16.4) 


(Pr4i wee Prva) dV dpr+1 dO,44 ee dpr+s dO,+.5 
(16.5) 


volume2 


=7 / 
1 ,---€r20,)) ee+2 >) peSt 


= or | nay +S Pig AV dipag WO yd pee, WO eg 
biameo ate ace +1 Prt+ ) Pr+l +1 Prt te 


= 2% (2n)° f saisp boots | AT Gopaee ed ies 
eh) bn tea +1°** Prts) Pr+i Prt 
1 
volume3 = Ors spe(rts)ts_ ‘16% 
“ . ((r +s) +s)! (16.6) 
tr 
— 9-878" 
n! 
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Note (16.4) follows by symmetry, (16.5) follows from polar change of coordinates, and (16.6) 
follows from the lemma below. 


Lemma 16.3.4: 


ay am 


i m 
«4>0,) a 


prt a D(a, +1)---T(am + 1) 
P(ay +--+ +@m+m-+ 1) 


Proof. Making the substitution x; = tx}, dx; = t dz’, we find that the integral equals 
pm ia au | i Perens yale dx, rae AL. 
>0,)> aj<1 


Hence it suffices to prove the lemma for t = 1. 
For m = 1, note 


1 1 I(a+1) 
se = = 
he a+1 IT(a+2) 


For m = 2, let B(a, 8) = fi v*1(1 — v)8 dv. We need to show B(a, 8) = #F@. By 
0 


Fubini, 
= / | st te 8th let ds dt = | : gt 148-Te- +4) ds dt. 
o Jo o Jo 


Note F' : (0,00) x (0,1) + (0, 00)? “ F (u,v) = (uv, u(1—v)) is a diffeomorphism. Indeed, 
it has an inverse F—'(s,t) = (t+ s, is =) hence is bijective and its Jacobian is det (12 “.) = 
u #0. Using the change of variables (s,t) = F'(u,v) gives 


1 foo 1 oo 
| | (wv)*1(u(1 — v))P- te" te) y du du = | | rte le-tye1(1 — vy)? du du 
0 Jo 0 


= (f- gore! du) ([ ve (1—v)F dw) 


=T(a+ §)B(a, 8), 


as needed. 
Now we use induction; suppose the theorem proved for m — 1. We have 


1 
a. 

/ BN Uke ae =| ey, a eee gy Oe ae Ole, 

20,5), Sl wi20,)),-, BiSl—-zm 


= [om (1 — Tin) 1+ Sy as Pla + 1) + Mam-1 + 1) din 
P(ay+++++@m-1 +m) 
_ P@m + 1)P Co: a; +m) T(a,+1)---T(a@n_1+1) 
~ Tar tes: +am+mt])) Ta +-::+am—1+m) 
T(a, +1)---T (am +1) 
D(a, +++++@m+m+4+1)’ 


using the induction hypothesis and the m = 2 case. 
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Taking 


Qn—-r 
t= dnt: -NalAg|2 
ria 


we find by Lemma 16.3.3] that 


t” 
p(B) = 2?n— = 2" (2-*NalAx|?) = 2"u(D) 
where D is the fundamental parallelopiped. Note that B(t) is a closed ball, and it is convex 


by the triangle inequality. Hence by Minkowski’s Theorem, B(t) contains an element of (a). 
For this element, we have by (16.3) that 


t” 4\% n! 1 
N < = NalAxK|2. 
mxja(a) $= (=) =NalAxl? 


nr 


Theorem 16.3.5: iacatctass-group-rep Suppose AK’/Q is an extension of degree n, and let Ax = 
disc(X/Q). Let 2s be the number of nonreal complex embeddings of kK. Then there exists 
a set of representatives for the ideal class group Cl(/’) consisting of integral ideals a with 


Proof. Given a fractional ideal c, there exists 6 such that 
bc = (d) 


is principal. By Theorem|16.3.2} there is an element 6 € 6 of norm at most (4)° “Nb|AK|?. 
Since (3) C 6 we have 


ab = (8) 


for some a. Notea~ 6! ~ c, and taking norms of the above equation gives 


4 : n! 1 
NaN = N(3) < (=) “~No|Ax |. 


Canceling 9tb gives that a is the desired representative. 


Theorem 16.3.6:  ciass-number-fnite The class number of K is finite. 


Proof. By Theorem|16.3.5} every ideal class has a representative with norm at most Cx |Aj| 2, 
Thus it suffices show the following (take C = CeiAgle). 


Lemma 16.3.7: sinite-bounded-norm There are only a finite number of integral ideals a with Na < 
1 
C (take C= CrlAK 2). 
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Proof. Suppose a is an integral ideal. Write a = J] p;’. Let (p;) =p; Z and f; = [@x/p; : 
Z/(pi)|. Then 


Na = [[»'". 


Given Na < C, there are a finite possibilities for the p; and hence p;, as well as for the 7;. 


The bound in Theorem |16.3.5| also gives the following corollaries. 
Theorem 16.3.8: q-+ramises Every algebraic extension of Q ramifies over Q. 


Proof. It suffices to prove this statement for finite extensions. Let A’/Q be a finite extension. 
By Theorem |16.3.2| every ideal contains a representative a with 


1<|Nm(a)| < (4) ™ 


ae 


Hence we have , 
mr 


2s 
dk-bound Ax| > e (=) > 1. (16.7) 


ni2 \4 


The last inequality comes from the fact that defining a, = 7 Cm we have that aj > 1 
H n 
and Cutt — (7)? (1 + +) > 1 for n> 2. 


an 4 


Since Ax > 1 and every prime dividing the discriminant ramifies (Theorem|15.6.1), A/Q 
is ramified. 


Corollary 16.3.9: There does not exist an irreducible monic polynomial f(X) € Z[X] of 
degree greater than 1 with discriminant +1. 


Proof. Let f be an irreducible monic polynomial of degree greater than 1. Let a be a root 


of f. By Theorem |16.3.8} Q[a] is ramified over Q. By (16.7), |Ax| > 1. Then 


disc(f) = disc(Z[a]/Z) = |Ax|- [@x : Zlal]? > 1. 


4 Example: Quadratic extensions 


To compute the class group in quadratic extensions, note the following two facts. 


1. The complete description of prime ideals is given by Example ?? (actually put this 
in!). 


2. By Theorem |16.3.2} each ideal class has a representative of norm at most 4IAR|?. 


In fact, Minkowski’s bound can be improved in the quadratic case. 


143 


Number Theory, §16.4 


Theorem 16.4.1: (*) Let K = Q(Vd) where d is a negative squarefree integer. Let 


( fd gat (mod 4) 


3°? 


“2 d=2,3 (mod 4). 


m 


Every ideal class in @x has a representative a with 
Na < p. 


Proof. First we show that every ideal a has an element a 4 0 with Nmxvg(a) < pOt(a). For 
a lattice L let A(L) be the area of a fundamental parallelogram. 
Note that Nmx/o(z) = |z|?. An ideal a of K forms a lattice in C. Let a be the element 
of minimal nonzero norm in a and b be the element of minimal nonzero norm that is not a 
integer multiple of a. By the minimality assumption, since b—a cannot be a integer multiple 
of a, we have 
lb — al > [b| > la. 


Let A,B denote the points a,b and O the origin. Using the fact that in a triangle the side 
lengths are in the same order least-to-greatest as the opposite angles, we get that in the 
triangle AOB, the angle at O is largest, in particular at least 60°. Let O’ be so that OAO’B 
is a parallelogram. The minimality assumption similarly forces OO’ > AO, AO’, so we get 
ZOAO’ > 60°. Thus 

angie60° < ZAOB < 120°. (16.8) 


Furthermore, the parallelogram with sides OA and OB is a fundamental parallelogram: 
Suppose C’ is the point c € a, and is in the triangle OAB but not any of the vertices. Let 
OC intersect AB at C’. We have ZOC'’B > ZOAB > ZABO = ZC'BO, where the middle 
inequality is from OB > OA. Hence looking at AOC’B, OB > OC’ > OC, contradicting 
minimality of b. Similarly, if C is in ABO’, then we have |a+b—c| < |b], also a contradiction. 

By (16.8), the area of the fundamental parallelogram is 


3 3 
A(Gx) Ita = A(a) = lab] sin ZAOB > apy? = 3 Nimxiala) 


Solving gives 
2 
Nm a) < —A(@x)Ma. 
Kjg(@) S Fa (Gx) 
Finally note that for d= 1 (mod 4), a basis for Ox is (1, Las) while for d = 2,3 (mod 4) 


2 
vd 
2 


the basis is (1, Vd). The fundamental parallelograms have areas and Vd, respectively, 


giving 
Nmxyo(a) < pa. 
Given a fractional ideal c, there exists 6 such that 


bc = (d) 
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is principal. By the above, there is an element b € 6 of norm at most tb. Since (b) C b 
we have 


ab = (b) 


for some a. Notea~ b-! ~ c, and taking norms of the above equation gives 


Canceling Yt6 gives that a is the desired representative. 


NaI = MN(b) < pM. 


We give an example of computing the class group. The general procedure to compute 
the class group of A = OG where K = Q(vd) and d is negative and squarefree is as follows. 


Le 


2 


List the primes p < |]. 


For each p, determine whether p splits in A by checking whether 


is irreducible. 


. If p= aa splits in A, include it in the list of generators. 


. Compute the norm of some small elements (with prime divisors in the list found above), 


like k + 6 for k € No, 6 = Vd or i depending on d (mod 4). Factor Nmx/g(a) to 
factor 


(a)(a@) = (Nmx/@(a)); 


match factors using unique factorization. Note (a) ~ (@) ~ 1. Repeat until there are 
enough relations to determine the group. 


. For the prime 2, if d = 2,3 (mod 4), 2 ramifies, (2) = p?, and p has order 2 for 


d # —1,-—2. (Note p = (2,6) and (2, 1+ 46) in these two cases, respectively.) 


We first consider the cases when the class group is trivial. 


Theorem 16.4.2: The rings 


= —3, —7, —19, —43, —67, —163 


Z[V=1], Z[V—2I, Z A) vd 


2 


are unique factorization domains. 


In fact, they are the only ones (part of Gauss’s class number problem). 
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Proof. Note Z[/—1], Z[|/—2], and Z [4] are Euclidean domains and hence unique fac- 
torization domain. 


1t+vd 
2 


prime integers p < jz, which are the factors of (p) when it splits. When d = 1 (mod 4) as in 


The class group of Z is generated by the classes of prime ideals whose norms are 


all the remaining cases, an integer prime p remains prime in Z [e4] iff x? — x — z(1 — d) 
is irreducible modulo p, iff x? — x — +(1 —d) has no zero modulo p. We show that for 
d = —7,—11,—19, —43, —67, -163, x? — x — }(1 — d) is irreducible modulo all primes less 
than p. Then no prime ideals have norms that are prime integers p < py, and the only ideal 
class is that of the principal ideals. It follows that ZV d| is a principal ideal domain and 
hence a unique factorization domain. 


d lw) w= a? —x+4(1—d) | Primes p < |u|, 2? — 2+ 4(1—d) (mod p) 
-7 |, /z| =1 None 
-l1 | [Y#]=1 None 
-19 | | =2 one cere oe ee eae eee — be 
-43 | B/ =3 2 —7+11 2: 27+2+1=1forz=0,1 
(_ = 
3 2-2-1 Lt tore = 0,1 
1 for 7 =2 
-67 | |V@]=4 x? —x+17 2: 2+2+1=1 fort =0,1 
( _ 
i 1 tore¢= 0,1 
1 for 2= 2 
-163 | /*| =7 xe? —7+4l 2: 22+2+1=1forrz=0,1 
(_ = 
a I fore = 0,1 
1 fore = 2 
(4 for ¢ = 0,1 
5: P—s+1=43 for « = 4,2 
(2 fore =3 
| =i tor = 0,1 
i1 for a = 2,6 
Ve a a, —— 
15 for = 3.5 
id for = 4 


Change for consistent notation. 


Example 16.4.3: We compute the class group of Z[/—41]. 
For d = —41, |p| = \2/4| = 7. Modulo 2, 3, 5, and 7, -41 is congruent to 1, 1, 4, and 
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1, which are all squares. Factor 


(2) = AA 
(3) = BB 
(5) = oC 
in = Bp 


Then the class group is generated by (A) ,(B) ,(C) ,(D). (Note that (A) = (A), etc.) We 
have 
(1+. 6)(T¥8) = (42) = (2)(3)(7) = AABBDD. 


If a prime ideal P divides (1+ 6) then P divides (1+ 6). Hence the conjugate factors are 
divided between (1+ 06) and (1+). Without loss of generality, we can suppose 


(1+6) = ABD. 
The class of a principal ideal is the identity in the class group, so 
(A) (B) (D) = Leet (16.9) 


Next consider 
(2 + 5)(246) = (45) = (3)?(5) = B°B’CT. 


Note that 3 does not divide 2+4 so BB = (3) doesn’t divide (2 +65). Thus B?, B’ divide 
(2+06), (2+) in some order. Since we haven’t distinguished between C and C yet, we may 
assume WLOG that (B)? (C), (BY (C) are equal to (2+ 6) and (2 +0) in some order, and 


or 


(C) = (BY? eva (16.10) 
Similarly, looking at 
(3 + 6)(3 +4) = (50) = (2)(5)? = AAC’, 
we get that : 
(A) (GC) =1 or (A) (CY = 1. 


Noting that A = A (since (2) = (2,1 + 6)(2,1 —6) and (2,1+6) = (2,1—06) whend=3 
(mod 4) by [Artin, 13.8.4]), 
(A) = (CY? 202 (16.11) 


From (16.10) we may omit (C) from the list of generators for the group, from (16.11) we 
may omit (A), and from (16.9) we may omit (D). Thus the class group is the cyclic group 


generated by (B). From (16.10) and (16.11), we get 
(A) = (BY* vava (16.12) 
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Since A is not principal, (B)* 4 1. Note (A) = (A) = (A)~' implies (A)* = 1. Combining 
this with (16.12) gives that (B)* = 1. Since (B)” 4 1 for any proper divisor n of 8 (it 
sufficed to check n = 4), the class group is cyclic of order 8, Cg. 
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The algebra of quadratic forms 


quadratic-forms 
We follow Cox , except for the proof of Gauss composition, when we follow Cassels 
(add reference). The last section is based on Bhargava’s paper [Bha04]. 


1 Quadratic forms 
quadratic-forms1 


Definition 17.1.1: Let R be an integral domain. A quadratic form on R is a function 
on R”, in the form 


F (@iysssy En) = S- AjjpUjL;. 


1<i<j<n 


Supposing FR is a UFD, we say f is primitive iff gcdy<j<j<y, ij = 1. 


A quadratic form may be represented by a matrix 


aj2 Q1i,n-1 ai 
ae aad, | 
ai2 a a2,n—-1 a2,n 
2 22 2 2 
a1,n-1 a2,.n-1 a 1 1 An—1,n 
2 2 fb 2 
l G1,n a2,n an—-1,n a | 
2 2 2 mn 


(working in kK = Frac(R) as necessary to allow division by 2); we have 
f(x) =xQx". 


Definition 17.1.2: We say two forms f and g are equivalent if there are is an invertible 
matrix A (ie. a matrix with determinant a unit) such that 


f(x) = g(xA"). 


We say f and g are properly equivalent if det(A) = 1. 


149 


Number Theory, §17.2 


Note that the matrices corresponding to f and g are related by 
Q; = A'Q,A. 


For the rest of this chapter, we will focus on integral binary quadratic forms, i.e. those 
in two variables over Z. 


2 Representing integers 


Definition 17.2.1: We say that f represents n if there exists x = (x1,...,2,) such that 
f(x) =n. Wesay that f properly represents n if we can choose x so that ged(21,...,%n) = 
1. 


Lemma 17.2.2: p-rp>A form f(x,y) properly represents n if and only if f(x,y) is properly 
equivalent to the form na? + b/ry + c'y? for some b',c’ € Z. 


Proof. If f(p,q) =n with (p,q) relatively prime, then by Bézout we can find r,s such that 
ps — qr =1. Let f(x,y) = ax? + bry + cy?. Then f is equivalent to 


f(px + ry,qz + sy) = f(p,q) x? + (2apr + bps + brq + 2cqs)xy + f(r, s)y’. 
——{—’ 


For the converse, note that na? + bry + cy? properly represents n by taking (x,y) = 
(1,0). 


Theorem 17.2.3: represent-ift square Let n # 0 and d be integers. Then the following are equiv- 
alent. 


1. There exists a binary quadratic form of discriminant d which properly represents n. 


2. d is square modulo 4n. 


Proof. Suppose f is a binary quadratic form of discriminant d properly representing n. Then 
by Lemma |[l7.2.2} f is equivalent to some form nx? + bry + cy?. Hence the discriminant is 
d = b* —4nc, and d= b* (mod 4n). 
Conversely, suppose b? = d (mod 4n), so b? = d+4nc for some integer n, i.e. d = b?—4nc. 
Then 
f(x,y) = na? + bry + cy? 


properly represents n, as f(1,0) =n, and disc(f) = b? — 4nc = d. 


Corollary 17.2.4: correpresent-ift-square Let n be an integer and p an odd prime not representing 


n. Then (=) = | iff p is represented by a primitive form of discriminant —4n. 


Proof. Note (=) = 1 it (=) = 1, and this is equivalent to the second statement by the 
theorem. 
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The results in this section are particularly useful if there are few quadratic forms with 
determinant d. There is a method to list all these quadratic forms, as we will show in the 
next section. 


3 Reduction of quadratic forms 


We would like to have a canonical representative for every equivalence class of binary 
quadratic forms. We choose the one with “smallest” coefficients. This is made precise 
by the following definition. 


Definition 17.3.1: A positive definite binary quadratic form ax? + bry + cy? is reduced if 
it is primitive and 
lb} < ase 
and 
b> Ui |b) =qore =< 


Theorem 17.3.2: one-reducea Every equivalence class of primitive binary quadratic forms con- 
tains exactly one reduced form. 


There’s a more enlightening proof using the action of GL2 on the upper half plane. 


Proof. Existence, Step 1: We first show there is a form in the class with |b] <a <ce. 
Take the form f(x) = ax? + bry + cy” in the equivalence class such that |b] is smallest. 
Note a,c > 0 because the form is positive definite. We claim that a,c > |b]. Indeed, we have 


f(z+my,y) = ax? + (2am + b)zy + (am? + c)y’, 


so —b < 2am +b < 6b for all m € Z, giving a > |b). Similarly, c > |). 
Next, if a >, then replacing (x,y) by (—y,x) we get c> a> |0]. 


Step 2: The form is reduced unless b < 0 and a = —b or a=c. We tackle these cases next. 
In these cases ax? — bry + cy? is reduced, so it suffices to show ax? + bry + cy” are equivalent. 
In these two cases we make the following substitutions: 


f(v,y) =ax* —azyt+ cy? = f(ety,y) = ax" + axy + cy? 
fay) ax? + bry +ay? —> f(-y,2) = ax? — bry + ay”. 


Uniqueness, Step 1: We claim that for (x,y) € Z? with ry 4 0, and f(x,y) = av? +bry+cy? 
with a,c > |b], we have 


f(x,y) = (a — |b| +c) min(2*, y*). 


Indeed, without loss of generality assume x > y. Then 
f(x,y) = (a — |bl)zy + cy’ > (a — |b] + c)y’. 
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As a corollary, for xy 4 0, 
ax’ + bey + cy? >a—|bl| +c 


with equality iff x,y = +1, cy = —sign(b). 
Step 2: To distinguish between reduced forms, we examine the smallest nonzero values at- 


tained by a them, and the number of primitive solutions to them. Note all solutions (2, y) 
with zy = 0 and one of ||, |y| > 2 are removed from consideration. 


1. If |b| < a <c, then the smallest values attained by f primitively are 
a<c<a-—|bl+c 
with solutions (+1,0), (0+1) and +(-—1,sign(b)) respectively. 
2. If b> 0 and |b| = a <c, then the smallest values attained by f primitively are 
a<c=a-—|bl+c 
the first has 2 solutions and the latter has 4 primitive solutions. 
3. If b > 0 and |b] < a=c, then the smallest values attained by f primitively are 
a=c<a-—|bl+c 
the first has 4 solutions and the latter has 2 primitive solutions. 
4. If b>0 and |b| = a =c, then the smallest value attained by f primitively is 
a=c=a-|bl]+c 
which has 6 primitive solutions. 


After examining this data, the only reduced forms that could be equivalent are those falling 
in the first category with opposite b’s, i.e. ax? + bry + cy’. But any change of variables 
sending one to the other must preserve the solutions (+1, 0) and (0, +1), so must have matrix 
(4! £,). If this matrix has determinant 1, then it must be +J and cannot change between 
the two forms. 


Suppose d < 0; note that there is an algorithm to list all reduced quadratic forms with 
discriminant d. The conditions |b] < a < c and b? — 4ac = d give 


b= 6 = dae < ? de? = 3a": 


<,/_4 
a< 3° 


We simply check for solutions to b? — 4ac = d for all 0 < |b] <a < y-!. 


Hence 
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3.1 Examples 


Example 17.3.3: When n = 1,2,3, the above check gives that the only reduced form of 
discriminant —4n is x? + ny?. 

Combining this fact with Theorem we get that f properly represents m iff d := 
—4n is a square modulo 4m, i.e. —1 is a square modulo m. Thus we have the chain of 
equivalences: 


1. f represents m. 

2. f properly represents 7 for some square factor k? | m. 

3. dis a square modulo 73 for some m. 

4. dis a square modulo 7 for the largest square factor k? | m. 
5. dis a square modulo p for every p | m with ord,(m) odd. 


By quadratic reciprocity, we have 


Hence we have the following. 


m represented by | iff every such prime has even exponent in m 
oy p=3 (mod 4) 
x? + 2Qy? p=5,7 (mod 8) 
x’ + 3y? p=2 (mod 3) 


Compare this with the proof using factorization in Z[/—d].‘ In particular, note that 
Z|\/—d] is a UFD when d = 1, 2, and in these cases, there is exactly one form of discriminant 
—4d. This is not a coincidence! 

Next we show the following. 


Example 17.3.4: .25y2 A positive integer n is represented by x? + 5y? iff 


1. Any prime p = 11,13,17,19 (mod 20) appears in n with even exponent. 


‘When d = 3 we have to be slightly careful. 
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2. There are an even number of prime divisors that are p = 2,3,7 (mod 20), counting 
multiplicity. 


3. (No restriction on primes p = 1,5,9 (mod 20).) 


Note this condition is quite different from the ones before! 


Proof 1. This time we have to check a < / —2 < 3. The reduced forms of discriminant —20 
are 

f(a) Sa +5y° 

g(x) = 2x” + Qry + 3y’. 


We run into trouble already: Theorem |17.2.3] fails to distinguish between these. We still 
start with the same argument, though. 


Step 1: By Corollary |17.2.3} a prime p is represented by f or g iff (=) = 1. By quadratic 
reciprocity, 


(=) = (1) (P) = (1, p=1,3,7,9 (mod 20) 
7 1 p = 11,13,17,19 (mod 20). 


Step 2: Now we distinguish between these two cases. By checking modulo 4, we see that 
f only represents primes p = 1,9 (mod 20) (and 5) and g only represents primes p = 3,7 
(mod 20) (and 2).? By Step 1, f, g must represent all of these respective primes. 


Step 3: We have the desired result for primes. How to pass to products of primes? First note 
that primes p = 11, 13,17, 19 (mod 20) have to appear with even exponent (if x? + 5y? = 0 


(mod p), since (=) = —1, we must have p | x,y; now divide x, y by p and repeat). 
Now consider the magical identity 


x25y2-magic( 20” + Qey + 3y”)(2z? + 2zw + 3w?) = (Qeytaw+yz+3yw)?+5(2w —yz)’, (17.1) 


which says that a product of numbers represented by g is represented by f! This immediately 
gives the sufficiency condition. 

For the necessary condition, note we may divide x,y by 2 until they are not both even. 
Now take it modulo 8 to see that n = 1,4,5,6 (mod 8). This gives that item 2 is necessary. 


Wait a minute. Where does the magical identity come from? Historically this was the 
way such problems were solved, and in fact the motivation for composing quadratic forms: 
for primitive quadratic forms f,g,h, we say f og = h iff there exist integral bilinear forms 
B,, Bg satisfying certain conditions such that 


f(x)o(y) = h(Bi (x,y), Bo(x, y)). 


?These sets are disjoint; we say f,g are unique in their genus. 


154 


Number Theory, §17.3 


We won't go into the historical details, because the modern way of thinking of composition 
is cleaner (see Section [5). We know we had the “composition law” 


(a? + b*)(c? + d?) = (ac — bd)? + (ad + bc)”. 
We can view this as coming from the identity 
fermat-id-explained Nmx/g(a@ + bt) Nmxg(e + dt) = Nmxje((a + bt)(c + dt)) (17.2) 
where K = Q(i), so Nmxyjg(z) = |z|?. We now look at a different proof of Example 


Proof 2. This time the complication comes from that Z[/—5] is not a UFD, nor PID; its 
ideal class group has order 2, with representatives 


a= 1 
ba Gaca/a5), 
Step 1: Let p be prime. As in the proof of Theorem|8}8.2.1| we factor the equation x?+5y* = p 


in Z|,/—5] to get 

(2 + V—5y)(a — V—5y) = p. 
Now we know the ideal (p) splits iff x? +5 (mod p) splits, ie. (=) = 1. We calculated that 
this happens when p = 1,3,5,7,9 (mod 20). 


Step 2: So if p is of the above form, we know that either p is a product of two principal 
ideals, or two (conjugate) ideals similar to b. In the two cases, we have respectively 


(p) = A(3, 1 + V—5)A(3, 1 + V—5) 
for some A € Q(/—5). Then calculating the norm of the ideal in K = Q(./—5) gives 


p = Nmx;@() 
p = M((3,1 + V—5)) Nmx7o(A)?. 


Let \ = 4+ y/—5. In the first case, we must have p = x? + 5y?, so p = 1,5,9 (mod 20), 
while in the second case, we must have p = 3(x? + 5y) (x,y € Q, here) so when p is odd, 
p = 3-1,3-9 (mod 20). (We can check that x,y do not have 2 or 5 in the denominator 
by an infinite descent argument, so we may consider x,y € Z/20Z.) p = 2 is possible as 
(2,1+ /—5)? = (2). Thus again we’ve distinguished between the two cases. 


Step 3: A prime p = 1,5,9 (mod 20) splits into two principal ideals, a prime p = 2,3,7 
(mod 20) splits into two ideals of type b, and a prime p = 11,13,17,19 (mod 20) remains 
prime. In order for (n) to split into two principal ideals, we must be able to write 
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where c is a product of ideals, containing an even number of prime ideals of type 6, and ¢ 
contains the conjugates of those ideals. (Two ideals of type 6 multiply to a principal ideal.) 
The result follows. 


It seems like the quadratic forms in the first proof are related to the ideals in the second 
proof. This is indeed the case: we can explain (17.1) similarly to (17.2) by 


Nmxyo(2z + (1+ V—5)y) Nmx/9(2z2 + (1+ J/—5)w) 
M2, 1+ /—5) N(2,1+ /—5) 
_ Nmx/g((2e + (1 + /—5)y)(2z + (1 + V—5)w)) 
M((2)) 


The two forms on the LHS are exactly those on the LHS of while that on the RHS 
can be written in the form Bi + 5Bz because }(2x + (1 + /—5)y)(2z + (1 + V—5)w) is an 
integral ideal. We will see that in this way the group law on ideal classes translates into a 
group law on quadratic forms. 

After we establish Gauss composition, we will show the equivalence between a quadratic 
form Q representing a prime p, and (p) splitting into ideals of a certain form (Theorem|17.5.4). 


The above proof was a specific example of this. 


4 Ideals on quadratic rings 


Definition 17.4.1: We will be considering rings that are free Z-modules of finite rank. 
We call such rings quadratic, cubic, quartic, and quintic, if the rank is 2, 3, 4, or 5, 
respectively. 


The rings we are primarily interested are integral domains, which are exactly the rings 
that can be embedded in field extensions. 


Definition 17.4.2: An order @ in a finite extension K/Q is a subring of K containing 1, 
that is a free Z-module of rank [K : QJ. 


The maximal order of K is simply @x, the ring of integers of K. 


Definition 17.4.3: Let R be a ring that is a free Z-module of finite rank. The conductor 
of R is the greatest integer n for which there exists a ring JT such that 


C=Z4niT. 
(Necessarily, JT’ has the same rank.) 


If S is a quadratic ring then S = (1,7) for some 7 satisfying a quadratic equation 
Tt? + br +c = 0. If this polynomial is irreducible over Z, then S can be embedded in a 
quadratic field extension. Otherwise, S is not an integral domain. We make the following 
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definitions. The first four are equivalent to our previous definitions when S' is integrally 
closed. 


iF 


2 


The discriminant of S$ is the discriminant of the characteristic polynomial, b? — 4c. 


Conjugation is the linear transformation that takes 1 to 1 and switches the zeros of 
x? + br +c. 


. The norm of an element a € S is aq. 
. The numerical norm Mtp(a) of an ideal a € R to be [R: I] =|R/I].° 


. A basis (a, 3) for a C R is positively oriented if 


aa 
pee Lay 


BB 
disc(S) d 


We now describe all quadratic rings. 


Proposition 17.4.4: There is a bijection between D = {d€ Z:d=0,1 (mod 4)} and 
quadratic rings (up to isomorphism), given by 


S:d4 Zr] 


where Tg satisfies a monic quadratic equation with discriminant d. 
Moreover, 


d= fdx, 


where f is the conductor of Z[74] and, when d is nonsquare, dx is the discriminant of Q(T.) 
(dx = 0,1 (mod 4) and 16{ dx). 


i 


2 


An integer d € D corresponds to a integral domain if and only if d is not a square. 


If d=0 then S(d) = Z[z]/(2?). 


. If d is a nonzero square then S(d) = Z- (1,1) + Vd(Z@ Z). 


. If dx =1 (mod 4), dx £1, then S(d) = Z[ fr] = (1, fr) where 7 = Ltevdic 


2 


. If dx = 0 (mod 4) then S(d) = Z| fr] = (1, fr) where 7 = vax the root of the 


nonsquare part of d. 


3For fractional ideals a, ic. R-submodules of R ®z Q, take a fractional ideal 6 containing a and R and 
define Np(a) = 4 


[6:R] ° 
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Proof. Note the map is well-defined, because any two quadratic equations with discriminant 
d, say x? +bj;2+c;, 7 = 1,2, have b; = by = d (mod 2) and hence are related by the change 
of variable x +> x +k for some k. The map is injective because the discriminant doesn’t 
change under replacing 7 with 7 + k. 

For item 1, note d is a square iff the characteristic polynomial factors. Item 2 is clear; 
for item 3 note that we have the homomorphism 


Z{r]/ (1? — d) @ Z[r|/(r — Va) x Zr] /(r + Vd) FZ XZ 
1K (1,0 


TH (vd, —V4d) 


with image Z- (1,1) + Vd(Z@ Z). 
Now write d = f?dx; we will show f is the conductor. Choose b = 0 or 1 with b = d 
(mod 4) and ¢ such that 6? — 4c = d, and let 


S(de) =Z[r)/(7? + br +0) =Z a) 


2 


S(d) = Zr)? + for + fo = [=P EL 


Now S(dx) is the ring of integers of S(d), so the largest quadratic ring containing S(d); 
moreover the above representation gives 


conductor-in-gting ( d) = Z, + fS(dx), (17.3) 


so f must be the conductor. 
Items 4 and 5 come from (17.3) and the fact that Z [—“«] = LZ re |. 


4.1 Proper and invertible ideals 


From now on, assume that d is not a square. We create a bijection between the “ideal 
class group” of a quadratic ring of discriminant d and quadratic forms of discriminant d. 
To do this we first have to define the “ideal class group” of a quadratic ring. This is more 
complicated than defining it for a ring of integers, because a general order is not a Dedekind 
domain. We find that we first have to restrict the ideals under consideration, in order for 
inverses to exist.* Later we restrict the ideals further so that we have unique factorization. 


Definition 17.4.5: arproperiaeai A proper ideal of @ is an ideal such that 
C={Bek:BaCa}. 


(In general we only have C.) 


4Else we only get a semigroup. 
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Note that for the maximal order @x, all ideals are proper, and for any order, all principal 
ideals are proper. Furthermore, any ideal is proper for exactly most one order, namely the 
order {6 € K : Ba Ca}. The following tells us exactly which order that is. 


Lemma 17.4.6:  proper-idcator Suppose a = (a, 3) is an ideal in a order of a quadratic field. 
Suppose T = 2 has degree 2 over Q and satisfies the equation 


ax? +ba +c=0 


where a > 0, b, and ¢ are integers with gcd(a,b,c) = 1. Let K = Q(r). Then a is a proper 
ideal of R := (1,ar), and 
) - Nmx 9(@) 

a 


Mr(a 


As stated this only works for imaginary quadratic fields. 


Proof. Let @ be the order. Now (1,7) is also a fractional ideal of G C Q(7). We know 
GC={B EK: BaCa}. Now, £ is in this set iff 


p< (1,7) 
pre (1,7), 


8 =p+qr for some p,q EZ 
b C 
pr= (p+ ar)r= pr +q(—27-) € (1,7); 


since gcd(a, b,c) = 1, this is true iff a | g. Hence @ = (1, ar). 
For the second part, note 


Proposition 17.4.7: Let a be a fractional @-ideal. Then a is proper iff it is invertible. 
Hence the proper fractional ideals form a group /(@) under multiplication. 


Proof. If a is invertible, then ab = @ for some 6b. If Ga Ca, then 
BO = B(ab) = (Ba)b C ab= 6 


so 8 € @. This shows a is proper. 
Conversely, suppose a is proper. Write a = a(1,7). Letting ax? + br + c be the minimal 
polynomial of 7 with integer coefficients, by Lemma}17.4.6, @ = (1,aT). We show that 


Nmx /9(@) 
a 


aa = C,; 
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it will follow that ——*—@ is the inverse of a. 
Nmxo(@) 


First note @ = @, since G = (1,aT) = (1, a7) (on account of ar + at = —b). Hence a is 
actually an ideal of @. Next, we calculate 


= Nmx/o(@)(1,7,7, 77) 
_ ce 
= Nmx/o(a@) a, T+T,T, -*) 
b 
= Nmx/9(@) (. caer _S r) 
Nmx/g(@) (1,ar) 
a ? 


as needed (using gcd(a, b,c) = 1 in the last step). 


Let P(@) be the subgroup of principal ideals in [(@). Define the class group of @ to 
be 
C(@) = 1(@)/P(@). 


Let P*(@) be the subgroup of principal ideals in the form (a) where a is totally positive, i.e. 
positive under every real embedding. (This is an empty condition if @ is imaginary.) Define 
the narrow class group of @ to be 


Ct(6) = 1(6)/Pt(6). 


(This is an example of what is called a ray class group in class field theory.) 


5 Gauss composition 


g-comp 


Theorem 17.5.1 (Correspondence between ideals and binary quadratic forms): iacat-form-correspondence 
There is a bijection between 


1. narrow ideals classes in quadratic rings with given orientation and 
2. binary quadratic forms (up to proper equivalence), 
given by 


Nmxyo(ax — By) 
Mr(a) 


((1. aay) Z [=+“)) Q(x, y) = ax? + bry + cy’ 


2a 2 


(a = (a, 8),R) 
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where K is the quadratic field containing a, (a, 3) is a positively oriented basis for a, and 
d = b? — 4ac. This restricts to a bijection between invertible oriented ideal classes in the 
order of discriminant d and primitive binary quadratic forms of discriminant d: 


Corollary 17.5.2 (Gauss composition): gauss-composition There exists a group structure on 
equivalence classes of binary quadratic forms, induced by the group structure on ideal classes. 


Proof. Step 1: We show the forward map is well-defined. We need to check two things. 


1. Change of basis gives an equivalent form: Temporarily write Qa, a.(%,y) = a 


Suppose a = (a1, @2) = (b1, 62) where both bases are positively oriented. We can write 


(";,) =f (“.) , A€SL2(Z). 


Then 


ideal-qform-change-basis@) 5, ,b2 (a; y) = = = ae a2 ( (a5 y) A) 


(17.4) 
so the quadratic forms are equivalent. 


2. Multiplying by a totally positive element gives an equivalent form: Suppose J is totally 
positive. Then Nm (A) > 0. First note that (Aa, Adz) is also positively oriented: 


Aa va, ay ay 
Ab, ADL by by 
a = Nmx/9(A) s : > 0 
Then 
— Nm(Aazr — Aagy) 
Ones dds (x,y) —_ Mp (Aa) 
_ Nmx«/o(aix — azy) 
MWre(a) 
= iss (x, y) 
as needed. 


Step 2: We show this map is injective. First note an alternate characterization for the forward 
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map. Writing (a, 3) = a(1,7), we find that the quadratic form corresponding to (a, 3) is 


Qnalasy) = 
_ (ax — By) (ax — By) 
Wr(a) 
_ aux? — (a8 + UB)zy + BBY? 
7 Wr(a) 
Nmx/9(@) 


qform-factored == “Talay —_ Ty) (a a TU) T= ae (17.5) 


Suppose Qay,a2(%,Y) ~ Qb,,b0(#, y). By changing the basis of 6 = (61, 62), which by (17.4) 
corresponds to changing the basis of the quadratic form, we may assume Qa,a.(%,y) = 
Qb,,b.(z,y). The above factorization (17.5) says that one of the following holds: 


2 a = Letting A = i = ra) we find a = Xb. Since both bases are positively oriented, 
ay ay 
a2 a2 
0< So Nmx/o(A), 
by by 
by be 


a2 bo" by, bo” 
Then 
ay ay ay ay 
a2 a2 a2 a2 
0< —t — —1 = —Nm X), 
by by bby Kal) 
by be by by 


giving Nmx/a(A) < 0. But 


Qb1.b. (@, y) = Nr(a 
_ (ax asy) ae — tay) _ , x Giv — Bay) bree — bay), 
el 7 () nC) 


equating gives Nmx (A) > 0, contradiction. 


Step 3: Applying the reverse map and then the forward map gives the identity. 
Given Q(z,y) = ax? + bry + cy? = a(x — Ty)(x — Ty), the reverse map takes it to 
e= (1,7), Note {L7= eta) is in fact a Z-basis for (1,7) over R := Zlat] = Z [4 
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(not just a generating set over @). Indeed, ar(7) = (—br — c) € (1,7). In exactly the same 
way, {1, oT) is a Z-basis for R over R. 
By (17.5), the forward map then takes (a, R) to 


1 


Tala)” —ry)(x — Ty) = [a: B\(x — ry)(« — Ty) = a(x — Ty)(« — Ty). 


Step 4: Invertible classes correspond to primitive forms. Suppose a = a(1,7) is invertible 


and 7 satisfies ax? + bx +c = 0, where gcd(a, b,c) = 1. Then by Lemma|i7.4.6 a= ee 
Hence by (17.5), the quadratic form is ax? + bry + cy”, which is primitive. 

Conversely suppose Q is primitive. Then by Proposition [17.4.6] the corresponding ideal 
(1,7) is proper in R := (1,ar). 

The fact that the discriminant is preserved can be seen from the reverse map. 


Example 17.5.3: exiacr We calculate the binary quadratic form corresponding to the order 
@ of discriminant d. This will be the identity element in the form class group C(D). We 
have @ = (1,7) where 


f ve d=1 (mod 4) 
— : 
ia d=0 (mod 4). 
Then ( 
x? — wv d=0 (mod 4) 
x Nmx/g(x + yT) , 
Qo(2,9) = Nmxjole +y = + xy — <*y?, d=1 (mod 4) 
This is consistent with the fact that x? — 4 and 27+ — vt are the minimal polynomials 


of 7 in the two cases, respectively. 


Theorem 17.5.4: prrep-irtiaea Let a be an invertible ideal in the quadratic ring @ and f 
its associated quadratic binary form. Let m be a nonzero integer. Then the following are 
equivalent. 


1. There exists a’ in the same ideal class as a with 

a’a’ = (m). 
2. There exists a’ in the same ideal class as a with Ito(a’) = m. 
3. f represents m. 


As written this only works for imaginary quadratic fields. For real fields, f may represent 
—m instead. 


Proof. Equivalence of the first two items is clear. We show (2) <> (3). 
Suppose f represents m. Suppose m = d?a, and f represents a primitively. By Propo- 
sition |17.2.2| f is equivalent to a form ax? + bry + cy’. By Gauss composition, this form 
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corresponds to an ideal a’ = a(1,7) with ar? + br +c = 0 inside @ = (1,a7). Hence 
No(a') =a. Then 
Npo(da’) _ da, 


as needed. 
Conversely, suppose Mtg(a) = m. Write a = a(1,7) with Nmxg(a) > 0. Suppose 
at? + br +c = 0 with ged(a, b,c) = 1, so @ = (1, ar) and No((1,7)) = +. The corresponding 
quadratic form is 
Nmx g(x — Ty) 
LS 
He) = MT) 


Since a € @ = (1,aT), we have a = p—qar for some p,q € Z. We have at = pr—q(—bt—c) = 
(p + qb)r + cq; since at € @ = (1,a7) as well, we get wan € Z. Now by Lemma|17.4.6} 


= aNmx/o(x — Ty). 


m = No(a) = Newiale) 


1 
= aNmx/Q(p — gar) 


—bq—p b 
= g | ———,q g(z,y)=9 | —-y —2,y }. 
a a 


We showed above that —“H2 € Z, as needed. (Think of the last step as “root flipping.” ) 


a 


6 Ideal class group of an order 


Suppose @ is an order in the field K, and @x is the ring of integers (the maximal order). 
We want to relate C(@) to C(@x), because the latter is the most “natural” class group for 
K. In reality, we will relate C(@) to a quotient of a subgroup of I(@x), a generalized ideal 
class group of Ox. 

After learning class field theory, which relates generalized class ideal class groups to 
extensions of K, we will see that the primes represented by the quadratic form corresponding 
to @ can be characterized in terms of a certain field extensions L/K. 

CHANGE NOTATION: Replace Id with J and Cl with C in earlier chapters. 


Definition 17.6.1: Define 


Ix(f) = {a € Ix : a relatively prime to fOx} 
Px(Z, f) ={a@x:a=a (mod f@x) for some a € Z} 
Ik(@, f) = {a € I(@) : a relatively prime to fG}. 
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Theorem 17.6.2: ciass-group-o Let f be the conductor of @, i.e. GC = Z+ fOx. There is an 
isomorphism 

In(f)/Pic(Z, f) + 1(@)/P(@) = C(@) 
induced by the map g: Ix(f) > I(@), 


g(a) =an@. 
First, a preliminary lemma. 


Lemma 17.6.3: Let @ be an order of conductor f. Then every @G-ideal prime to f is proper. 


Proof. Cox, Prop. 7.20. Suppose a is prime to f. Thena+f@= @. Suppose Ga C a. Then 
BE = Bla+ f@) = Ba+ Bf Ca+fOxCcé 


so 0 € G. Thus a is proper. 
Proof of Theorem Step 1: We show there is a norm-preserving isomorphism 
Ix(f) > 1G, f) 


aran’é 


Step 2: The map above induces an isomorphism Ix (f)/Px(Z, f) > I(@, f)/P(G, f) 


Step 3: The inclusion I(@, f) — I(@) induces an isomorphism I(@, f)/P(@, f) > I(@)/P(@). 
This follows from Theorem [23]23.1.1 O 


7 Cube law 


We now derive quadratic composition in a different way. We will associate a “cube” of 
integers with three quadratic forms. In order to identify equivalent binary quadratic forms, 
we mod out by SL2(Z)?. After decreeing that the sum of forms making up any cube is 0, we 
find that we have 


1. identified quadratic forms up to equivalence, and 
2. recovered our original composition law. 


Later we will see that these ideas generalize to composition laws for other polynomial forms 
and associated ideals/rings. 

Let Co = Z? @ Z? @ Z?. Choosing a basis (v1, v2) for Z?, every element of C? can be 
written in the form 


AV, ® Vy ® vy + vy ® vq @ vy + Cv @ Vy ® vy + dug ® vo @ vy 
+ €V, © Vy @ Vo + fuy ®@ vg ® Vo + gv @ Vy ® vg + hve ® vo ® vo. 
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We represent this graphically as a cube. 


1 12—————— 122 


A/| a a 
Vai La we 


Think of this as a higher-dimensional analogue of a matrix. Let M;, N; for 7 = 1,2,3 be the 
two matrices obtained by slicing the cube along the 3 possible directions. 


m= (Ea) Mm=( 4) 


Define an action of TP = SL2(Z) x SL2(Z) x SL2(Z) on C2 by letting ({ {) in the ith factor of 
SL2(Z)? act on A by sending 


M; ee r Ss M; = rM; + sN; 
Note that the actions of the 3 factors of SL2(Z) commute, in the same way that row and 


column operations commute for a matrix. 
Now associate a cube A with three binary quadratic forms Q#, Q/, Q# by letting 


Qi(a,y) = — det(Mix — Niy). 


We call A projective if Q+,Q:, Q3! are all primitive. 
Invariant theory gives the following result. 


Proposition 17.7.1: The ring of invariants of Cz under SL2(Z)? is 
(C,)2@" = ZIAiso( A) 
where 


disc(A) := disc(Q1) = disc(Q2) = disc(Q3) 


= S- Ca ene », stuv + 4 S- stuv. 


s,t long diagonal s,t,u,v face s,tju,v regular tetrahedrom 
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(The fact that disc(A) is invariant is easy to see; we shall not need the opposite implica- 
tion.) 

We now prove the bijection in Theorem|17.5.1}and Gauss composition (Corollary |17.5.2) 
in a different way, using cubes. The idea is to associate triples of ideals multiplying to 1 with 
triples of quadratic forms in the same cube (which we will deem to add up to 0), and in this 
way transfer the group structure from narrow ideal classes to classes of quadratic forms. 


Definition 17.7.2: We say that three oriented fractional ideals J,, [>, /3 in a quadratic ring 
S form a balanced triple if 


I, Ip13 C S and 
N(1)NU2)N(Is) = 1. 


We say two balanced triples (1), Jo, /3) and (Jj, 15,5) are equivalent if there are Aj, 2, A3 
such that 


i, = Agli 
Ty = Agli, 
Iz = A3Ih. 


Theorem 17.7.3: There is a bijection between equivalence classes of cubes, and ordered 
pairs (S, (Jy, Jo, /3)) where S' is a quadratic ring and (J), Iz, /3) is a balanced triple modulo 
equivalence. 


Z* @ Z? @ Z?/SL2(Z)* + {(S, (hi, Js, Is))} 


If (a1, 2), ($1, 22) and (71, y2) are correctly oriented bases for ,, Iz, and J3, then the cube 
is given by (@ijx)1<i,j,4<2 Where 
OBI Ve = Cigh + QigeT 


and 7 is such that 


167 


Number Theory, §17.7 


168 


Chapter 18 


Units in number fields 


units-in-nf 


1 Units 


Any finitely generated abelian group is isomorphic to Atos @ Z’ where Ajors consists of all 
torsion elements, i.e. elements of finite order. The number t is called the rank of A. 
The main theorem of this chapter is the following. 


Theorem 18.1.1 (Dirichlet’s unit theorem): a. Let K be a number field with r real embed- 
dings and 2s nonreal complex embeddings. Then the group of units in K is finitely generated 
with rank equal tor +s—1. 


The idea of the proof is as follows. 


1. Following the idea of the proof that the class number is finite (Section [L6]3), we embed 
the set of units as a lattice in R” x R*. Since we want to send a group (under multi- 
plication) to a lattice (under addition), we take logarithms of the norm to define our 
embedding. In actuality, the homomorphism L is not injective, but the kernel will be 
finite, which is good enough. (See Proposition [18.2.2}) 


2. Construct independent units from elements generating the same ideal. We do this by 
finding a,y generating the same principal ideal and taking ay~'. Consider a fixed 
large symmetric convex compact set T’ of R’ x C*, which will contain elements o(a) 
by Minkowski. For a such that L(a) € T, (a) is one of a finite number of principal 
ideals (7%). Then ay," is a unit. 


However, since we want independent units, we look not for points in the form L(a) but 
rather of the form x(a) where x has norm 1. Think of this as “rotating” or “twisting” 
the unit that we get. 


First, a basic criterion for being a unit. 
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Proposition 18.1.2: Let K/Q be a finite extension. An element a € K is a unit if and 
only if Nm(a@) = +1. 


Proof. Suppose a is a unit. Then a~! € K and 
Nm(a) Nm(a~') = Nm(aa™) = 1 
so Nm(qa) = +1. 


Conversely, suppose Nm(a) = +1. Then by Theorem |14.2.3} letting 0, = I,...,0, be 
the distinct embeddings of K to the Galois closure, we have 


a- |] on(a) = Nmzx(a) = +1. 
k=2 


Hence a7! = +[] pf» on(a) € Ox. 


2 Dirichlet’s unit theorem 
We now prove Dirichlet’s unit theorem. 


Lemma 18.2.1: pouna-cegree-norm There are a finite number of algebraic integers a such that 


[(Qla): Ql) <m 


|a"| < M for all conjugates a’. 


Proof. The second condition means that the coefficients of the minimal polynomial f are 
bounded. Since the degree of f is at most m, there are a finite number of possibilities for 
the f and hence a.! 


Let {o1,...,0,} be the real embeddings and {0,41, O41,---;Ort+s,Fr+s} be the complex 
embeddings of K. Since 


Nm(q) = |o1(@)|-+ = |or(@)|lor41(@)|? --- |or+s(@)/?, 
we define the homomorphism 


L:K* +R" 
L(a) = (Injor(a)|,.--,In |or(@)|, 21m Jorsa(a)]--+ ,2In Jor+s(@)]). 


This is the composition of our previous embedding o with f: 


o:kK >3RxC a(a) = (o1(a1),..., or(ar)) 


FoR” xR SR Figen hp Seige Boag) =] (pens 5 le Be Peed | ye eg 21 Beal) 


'See Chapter |39| for... 
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Proposition 18.2.2: iucinn The image L(Ux) is a lattice contained in the hyperplane 
4 (Piers pes) PE ee SO 

Moreover, L has finite kernel. 

Proof. If L(u) = (21,...,;%r+s) € Ux then 


ty t-+++ 2,45 =In|oi(a)| +---+Injo,(a)| + 2In|o,41(a@)| +--+ +21n|o,4,(a)| 
= In|Nm/(a)| = 0. 


To show L(Ux) is a lattice it suffices to show it is discrete. To this end, we show the 
base elements 
B(r) = {(x1,-.-, rts) : |zj| < Ch 


centered at the origin contain finitely many points of L(Ux). Indeed, if g(a) € B(r), then 
|o,(a@)| < C for every embedding o;. By Proposition there are a finite number of 
possibilities for a. 

If a € ker L, then |ox(a)| = 1 for all k. Again by Proposition [18.2.1] there are a finite 
number of possibilities for a. 


Since Ux is abelian, we now know 


Ux =ker(L)@ L(UK) . 


—~— —_~— 
Lor lattice of H 


It remains to show the following. 


Lemma 18.2.3: L(Ux) is a full lattice in H. Therefore it has rank r + s — 1. 


Proof. Let x € R" x C. By Proposition|16.3.1| the volume of the fundamental parallelopiped 
of o(a) is 2-*-Na-|A,x|2. Note that multiplication by x multiplies the norm by Nm(z) (more 


precise here?) so the volume of the fundamental parallelopiped of o(a) is Nm(x)2~*Na-|Ax|2. 
Now suppose x is any element such that Nm(x) = 1. Let V = 2-*Na- |A,|2. Let T be 
any compact convex symmetric set with volume at least 2"**V. We note the following. 


1. By Minkowski’s Theorem, there is point of T in the lattice x -0(@x). 


2. Since T is bounded, all elements of T have norm bounded by a constant C.. If o(a) € T, 
then a has norm bounded by C.. By Lemma there are a finite number of 
principal ideals with norm bounded by C, say (71),.--,(%m). Then if o(a) € T, we 
have (a) = (4), ie. @ = uy, for some unit u, and some k. 


In conclusion, for each x we find a such that 


T > xo(a) = xo(uyx) for some k, 
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Le. 
s-sigmawX0 (1) E U oy, VE: (18.1) 
k=1 
Since T is bounded, so is U7, 7(y;,'). There exists C’ so that every coordinate of xa(u) is 
less than C’: 
x-sigmaua(xo(t)), < ic. (18.2) 


The idea is that this places a large constraint on the possibilities for ¢, so as we vary x 
between “extreme” values, we will have to get linearly independent uw. 


Take 
— c= — Cee) 


’ ’ Cirts—1 , 
k 
Then letting u; be such that (18.1) holds for x,,u,, we get by (18.2) that, componentwise, 


Cp) eel decal), 
Le. 
Lt) SF og) (Oe cf I 0, Oe 
Note the following. 


1. Every entry of L(u,) is negative except for the kth one, which must be positive because 
the entries sum up to 0. 


2. The sum of entries of L(u,), omitting the last term, is positive. 
The following lemma will show that L(u;),...,£(u,1s-1) are linearly independent. It will 
follow that u,,...,U,1s—1 generate a free abelian group. This means rank(Ux) >r+s— 1; 
we have equality by Proposition |18.2.2}since dim H = r+ s-—1. 
Lemma 18.2.4: Suppose that A is an x n matrix such that 

1... a3 < Ofort #7 and a,; > 0: 

2. am Aig > 0. 
Then A is invertible. 


V1 
Proof. Suppose v = { : } is a nonzero vector. Suppose 7 is such that |a;| is greatest. Then 


looking at the ith component gives isl aijv; = 0. Then 
S- AjjU; > AijVi + S> AjjUi > 0, 
a1 i#i 

so Av £0. Thus A is invertible. 


This finishes the proof of Dirichlet’s Unit Theorem. 
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3 S-units 
Definition 18.3.1: Let S be a finite set of prime ideals of kK. The ring of S-integers is 


Ox(S) = (\(@x)p = {a € K : ord,(a) > 0 for all p ZS}. 
pgs 


I.e. we allow dividing by elements whose “only prime factors” are in S$. The group of S-units 
is the group of units in Ox(S): 


U(S) = 6x(S)* ={a € K | ord,(a) = 0 for all p ¢ S}. 


There are more units in U(S) than in Ux; the following generalization of Dirichlet’s theorem 
says that we get an “extra” unit for every prime in S. 


Theorem 18.3.2 (Dirichlet’s S-unit theorem): asu: The group of S-units is finitely generated 
with rank r+ s+ |S|—1. 


Proof. Let S = {pi,..., pz}. Consider the maps 
Ux 3 U(S) > Z™ 


where 
wa) = (ord, (@) cei, Ordy_(7)): 
Its kernel is Ux, as the elements of Ux are exactly those x with order 0 for every prime p, 


and by definition ord,(x) = 0 for x € U(S) and p outside of S. Let h be the class number of 
Kk. Then pf = (ax) for some az. We have 


Hence y(U(S)) is a full lattice in Z™. Since Ux has rank r + s — 1 by Dirichlet’s Unit 
Theorem (18.1.1), U(S) has rank r+s—1+m. 


4 Examples and algorithms 


5 Regulator 
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Chapter 19 


Cyclotomic fields 


cyclotomic 


1 Cyclotomic polynomials 


Definition 19.1.1: A cyclotomic extension of Q is a field Q/¢] where ¢ is a root of unity. 
We call ¢ a primitive nth root of unity if ¢” = 1 but ¢" 41 for0O<m<n. 


We will use ¢, to denote a primitive nth root of unity. 
The nth cyclotomic polynomial is defined by 


oie) = Il (c-—en ) 


O<j<n,ged(j,n)=1 


Equivalently, it can be defined by the recurrence ®,(x) = 1 and 


a | 
eee ®,,(x) , 


®, (x) = 


Hence, it has integer coefficients. 
Theorem 19.1.2: cyciotomicirreducibie The cyclotomic polynomials are irreducible over Q/z]. 


Proof. We need the following lemma: 

Suppose w is a primitive nth root of unity, and that its minimal polynomial is g(a). Let 
p be a prime not dividing n. Then w? is a root of g(x) = 0. 

Since ®,,(w) = 0, we can write ®, = fg. If g(w?) #0 then f(w?) = 0. Since w is a zero 
of f(x”), f(x?) factors as 


f(a?) = g(a)h(a) 


for some polynomial h € Z[z]. 
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Now, in Z/pZ|a| note (f; +...+ fx)? = fP +... + fP since the pth power map is an 
homomorphism. Hence 


g(a)h(x) = f(a?) = f(x)? (mod p). 


Hence f(a) and g(x) share a factor modulo p. However, the derivative of x” — 1 modulo p is 
na”! £0, showing that 2” — 1 has no repeated irreducible factor modulo p; hence ©, has 
no repeated factor modulo p. Since ®,, = fg, this produces a contradiction. 

Therefore g(w”) = 0, as needed. 

Any primitive nth root is in the form w* for k relatively prime to n. Writing the prime 
factorization of k as p,---Pm, we get by the lemma that w?!, w?!??,...,w?P!P™ are all roots of 
g. Hence g contains all primitive nth roots of unity as roots, and ®,, = g is irreducible. 


Theorem 19.1.3: cyclotomic-degree 


[Q(cn) : Q = y(n). 


Proof. The minimal polynomial of ¢,, equals the cyclotomic polynomial by Theorem {19.1.2} 
the latter has degree y(n). 


We use cyclotomic polynomials to prove a special case of Dirichlet’s theorem. 


Theorem 19.1.4 (Dirichlet’s theorem for p = 1 (mod n)): airicitesi(T) Let n be a positive 
integer. There are infinitely many primes p with p=1 (mod n). 


Lemma 19.1.5: For any integer m, all divisors of ®,(m) either divide n or are 1 (mod n). 
Proof. Suppose p is prime and p | ®,(m). Then p | m” — 1, ie. 
m" =1 (mod p) 


so r :=ord,(m) | n. Since m?~! = 1 (mod p) by Fermat’s little theorem, r | p — 1. 
Ifr =n, then n | p—1, i.e. p=1 (mod n). Suppose that r <n. Then 


m”—1 


Pp | ®,,(m) | mG») oe a ee 


m—1 


However, m" = 1 (mod p) so 


mEDs..tm41st (mod p), 
(4 


sop|®|n. 


Proof of Theorem|19.1.4, Suppose by way of contradiction that only finitely many primes 
are 1 (mod n). Let their product be P (if there are no such primes, P = 1). Consider 
®,(knP), k € Z. Since it divides (nP)" — 1, it can’t have prime divisors in common with n 
or P. With appropriate choice of k we can be sure ®,(knP) 4 0,+1. By the claim all prime 
divisors of ®,,(knP) are 1 (mod n), but they don’t divide P, contradiction. 
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2 Ring of integers 


Our next two propositions will give us information about the ring of integers of Q|¢], as well 
as some other useful facts. In the process we will rederive Theorem |19.1.3 


Proposition 19.2.1: cyctotomicunis Suppose ¢ and ¢’ are primitive nth roots of unity. Then 


ag is a unit in Z[¢] = Z[¢']. 


Proof. Then we have ¢’ = ¢* and ¢ = ¢" for some s,t, so Z[¢] = Z|¢’] and 


yee ea a 

ee ac ee 

i ! ore nt—1 

fae tet +¢°" € ZC). 
Therefore cag is a unit in Z[¢]. 


Proposition 19.2.2: cyciotomicp Let p be prime and r € N. Suppose p” > 2, let Gr be a 
primitive p’-th root of unity, and let K = Q|G,r]. Then 


1. [Ql¢pr] : Q| = yp") = p"(p — 1). 
2. The element 7 = 1 — Gp is prime in Ox, and (p) = (1) *"). 
3. Ox =Z2|Grl. 


Ppp 


A. disc(Ox/Z) = (-1 pP 


Proof. By Proposition |19.4.1} 


pal4+ XP 4. XO 
= ®,r(1) 
= II (i=) 


¢’ primitive p’th root of unity 


_ Il (1 _ Cpr) 


¢’ primitive pth root of unity ~ Cpr 


= u(1 — ¢) rr") 


where u = Je primitive p"th root of unity ae is a unit by eee oe Thus (p) = (1)?"). 

From the degree equation (Theorem [15]15.5.2), we get that [Q[¢] : Q] = y(p") with strict 
inequality when 7 factors further. On the other hand [Q/¢] : Q] < y(p") since the cyclotomic 
polynomial has degree y(p"). Hence equality must hold, and 7 must be prime, giving (1) 
and (2). 


pr-r-1)_ Thus p is the only prime ramifying in Q{G,°]. 
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The degree equation for (p) = (7)*") reads 


so we must have f((7)/(p)) = 1, ie. the natural map 
eit Z/(p) — Ox (1) (19.1) 


is an isomorphism. 


We first calculate disc(Z[¢]/Z). By Proposition [14]14.4.4| 


disc(Z[Cpr]/Z) = + Nmg¢e,r)/o(®}r(¢)) 


', (6) = (4) 


x=¢ 
PRA a1) 1p a 
—— r—1 _ 2 
Car) eee 
DG BG 


where we set ¢, = ¢?"~". this is a primitive pth root of unity. We calculate the norm of each 
factor. 


1. Nmg(¢,)/q(p") = (p?) Or) = pre *@-0, 
2. Nmg,)/a(G) = £1 since ¢;" is a unit. 
3. Nmgv¢,)/a(¢p — 1) = p”’: The minimal polynomial of Gp — 1 over Q is ©,r(X +1), 


whose constant term is ®,(1) = XP? '@-) 4... 4+ XP" +4 1]x-1 = p. Hence by 
Proposition |14]14.2.3(1c), we have 


y 


: (p") es 
Ning«,)/o(p a1 (+p) 2Gor):2Co)] =+p ro) = +p? a 


Combining these we get 


i DG rolippope 
disc-cyclotomic-p disc(Z[¢,r]/Z) = Nmgvc,r)/@ ia = i = pp" = ye (p » (19.2) 


By Proposition (fix this a bit), we have 
pr) — disce(OK /Z) = (Ox : Z[Gr])? disc(Z[C]/Z). 


Hence both factors are powers of p up to sign. Since (Ox : Z[¢,r]) is a power of p, the 
quotient module is annihilated by a power of p, i.e. then 


soem Ox Ss Z|Cpr | (19.3) 
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for some m. Note surjectivity in gives Ox = Z+7@ x and hence 
ocx = Z|Gr| + 16x. (19.4) 
Suppose Ox = Z{G,r| + 7"@x. Then substitution into gives 
Ox =U Gr] + 16x = Z[Gpr] + 1(Z[Gpr] + 2" Ox) = Z[Gr] + "1" Or. 


Hence Ly induction, Ox = Z[¢,-] + t”@x for all n. However, (p) = (7)? so this means 
Ox = Z|Gr|+p" Cx - alln. Taking n = m, (19.3) gives Ox = Z{¢pr], proving (3). Together 
with ( (19.2), this gives (4). The second part of (4) now follows from Theorem [15]15.6.1] (A 
prime ramifies if and only if it divides the discriminant). 

All embeddings of Q(¢,,) are complex, and there are y(n) = [Q(¢,) : Q| of them. By 


Theorem 14] 14.4.6(1), the sign is (—1)?®"). 


Now we prove the analogous result for Q(¢,,), for any n € N, by taking compositums of 
fields of the form Q(G,r). 


Theorem 19.2.3: Let n,r € N with n 4 2 (mod 4)!. Let ¢, be a primitive nth root of 


unity. 
1. (Qn) : Q) = y(n) 
3. 
( 1)*Pne™ 
disc(Ox /Z) = nee 
pin PP? 
Moreover, 


1. If p ¥ 2, then p ramifies iff p | n. 
2. If p = 2, then p ramifies iff 4 | n. 


Proof. Let K = Q(G,). Along with the theorem statement, we will show that if n = p’m, 
ptm, then 


syclotomié-tactotination-1( D) = (I 3) ee (19.5) 


for distinct primes 58;. 
We induct on the number of prime factors of n. The case when n is a prime power is 
treated by Proposition |19.2.2| Suppose the theorem true for m and p {m; consider n = p™m. 


If n = 2 (mod 4), note Q(¢n) = Q(Gn 2): 
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Writing ¢,, = C™ and Gj, = CP", we consider 


Q(Cprm) (pOx)°?? = TI; pix 
PA <o(p") we —. 
Q(Gr) Qn] 
y(p") y(n) a, tle 
Q 


By Proposition 2), (p) = p?”") in Q\Gr], while by part 2, p splits into disctinct factors. 
Matching factorizations in Q[¢,-m], we get that each p;@x must be a perfect y(p")th power. 
Hence [Q(¢,,) : Q| = y(p"), and equality must hold. Then [Q(¢,) : Q) = y(p")y(m) = y(n) 
showing (1). 

Item (2) follows from Proposition{14]14.4.8]since by (3), disc(Q(G,r)/Q) and disc(Q(Gn)/Q) 
are relatively prime. Item (3) follows from Proposition [14]14.4.8] as well. The factorization 
comes from the fact that since [Q(Gprm) : Q(Gm)] = y(p") and each p; is the y(p")th power 
of an ideal, the degree equation says each p; must actually be the y(p")th power of a prime 
ideal. 


We now show a more precise version of (19.5), using Theorem |15.6.3 


Theorem 19.2.4: cyciotomic-factorization» Suppose that n = p’m, where p{m. Let 


} = ord, (p). 


Then the prime factorization of (p) in Q(¢,) is 


(p) = (Bi, -- BP” ) 


where ‘8, are distinct primes, each with residue degree f over Q, and g = ee), 


Proof. ({)* To use Theorem |15.6.3} we find the factorization of ©,(X) modulo p. We have 


eelaniencconarap sgt (X) = Il (X = ¢) = Il Il k (mod* p’) (X — Go: (19.6) 


j (mod* n) j (mod* m) 


Now note that . 
X-C.Ch =X —-C (mod Gr — 1). 


Hence (19.6) gives 


®,(X)= JL (X-G)°®0 =6,,(x)"?®) (mod Gr — 1). 


j (mod* m) 


For an alternate proof see Example |24]24.1.6 
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But both sides are in Z[X] so this congruence holds modulo (¢,- — 1) NZ = (p). 

Now consider ®,,(X) (mod p). Note that modulo p, P(X) := X™— 1 has no repeated 
factors since it is relatively prime to P’(X) = mX™~! 4 0; hence its divisor ®,,(X) has 
no repeated factors either. Note Fy consists exactly of elements with x?! = 1, any root 
a of ®,,(X) satisfies a” = 1 (but not a” = 1 for 0 < m’ < m). Thus the smallest field 
extension F,,, containing a is hence the smallest r such that m | p" — 1, i.e. r = ord,,(p). 
The irreducible factors of ®,,(X) have degree f, so f is the residue degree. The number of 


factors equals eee), and this is the number of distinct prime divisors of (p). 


3 Subfields of cyclotomic extensions 
Proposition 19.3.1: gatoiscyciotomic The Galois group of Q(¢,)/Q is 


G(Q(6n)/Q) = (Z/nZ)”. 


Proof. The conjugates of ¢, over Q are ¢* with k € (Z/nZ)*, the roots of &,. The Galois 
group acts transitively on the conjugates, so for every k € (Z/nZ)*, there is a automorphism 
o;, sending ¢, — ¢*, and these are all the automorphisms (look at the degree). Since ¢, 
generates Q(¢,,), the action of an automorphism on ¢,, determines it completely. It is clear 
that k + o, is an isomorphism (Z/nZ)* > G(Q(G,)/Q). 


Proposition 19.3.2: The unique quadratic extension of Q contained in Q(¢,) is 


Proof. By the fundamental theorem of Galois theory, a quadratic extension corresponds to 
a subgroup of index 2 in (Z/pZ)* ~ Z/(p —1)Z, and there is exactly one such subgroup. If 
it equals Q(V/d), then the only primes ramifying are those dividing d; since the only prime 
ramifying in Q(¢,) is p, we must have d = +p. 
To determine the sign, we explicitly find express a generator for Q(Vad) in terms of ¢,. 
Define 7 by the Gauss sum 
p-l 
r=2()¢ 


k=1 


An automorphism o € G(L/K) is described by o(¢,) = ¢2 for some 7; we have 


-E()e-E02) 


k=1 kas PP 


Cp 


so ot = T iff (4) = 1, which happens for exactly half the elements of G(L/K). Hence r 


indeed generates a quadratic field.® 


3This gives motivation for the Gauss sum appearing in the proof of quadratic reciprocity. 
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Now, if p= 1 (mod 4), we have (=) = | and we can pair (£) CE + (=) ¢o* ER, while 
if p= 3 (mod 4), we have (+) = —1 and (*) 3 - (=) Pu € Ri. This gives the sign of d. 
Alternatively, we can calculate 7 explicitly as in (BLAH). 


Proposition 19.3.3: For n > 2, Q(¢,,) is a CM-field with totally real subfield 


OG. +651) = @ (cos), 


Proof. 


4 Fermat’s last theorem: Regular primes 


Theorem 19.4.1: cyctotomicunits Any unit u € Z[¢,] can be written in the form 


u=Cky 


where v is totally positive, i.e. o(v) € R for any embedding a : Q|¢,] > C. 
Definition 19.4.2: A prime p is regular if p does not divide the class number of Z[¢,]. 


Theorem 19.4.3 (First case of Fermat’s last theorem for regular primes): Suppose that 
p> 2isa regular prime. Then any integer solution to 


aP + y? — zP 
satisfies p | ryz. 


Proof. For p = 3, note that any cube must be congruent to 0 or +1 modulo 9. Hence in 
order for x? + y? = z° (mod 9), one of x,y, z is divisible by 3, as needed. 
Now assume p > 3. By dividing by gcd(z, y, z) we may assume 2, y, z are relatively prime. 
Step 1: Factor the equation as 
p-l 
fermat-factored | | (@ + C2y) = 2”. (19.7) 


j=0 


(Note p is odd.) We show that if p { xyz, then the factors on the left are relatively prime. 
Take j # k and consider a := ged((a + Gy), ((x + Ghy))). We have 


a| (c+ Gy —ax— Cry) = (G- C)(y). 


Now z,y have no common factor in Z, so (#) and (y) have no common factor in Z[¢,], and 
(x + Gy) and (y) have no common factor. This shows 


a|(C=¢). 
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The RHS is prime, so either a = (¢/ — ¢*) = (1 — p) or a = (1). In the first case, we get 
(1 —p) | Tez9(a + ¢ly) = 2” so p| 2”, contradiction. 


Step 2: By uniqueness of ideal factorization, each factor of (19.7) is a perfect pth power. 
(a + Gy) = a} 


However, p { |C(Z[¢,])| so C(Z[¢,]) has no p-torsion. Since (x + (Jy) is a principal ideal, a, 
must also be a principal ideal (a;). By Theorem |19.4.1] we can write 


aj =Ciu;, vy € Q(GI. 


5 Exercises 


Problems 


1.1 Let p be a prime. Prove that any equiangular p-gon with rational side lengths is 
regular. 


1.2 (Komal) Prove that there exists a positive integer n so that any prime divisor of 2” —1 
is smaller that 27993 — 1, 


1.3 Find all rational p € [0, 1] such that cos pz is... 


(a) rational 


(b) the root of a quadratic polynomial with rational coefficients 


1.4 (China) Prove that there are no solutions to 2cospr = /n+1-— \/n for rational p 
rational and positive integer n. 


1.5 (TST 2007/3) Let 6 be an angle in the interval (0,7/2). Given that cos @ is irrational 
and that cos k@ and cos|(k + 1)6] are both rational for some positive integer k, show 
that 6 = 7/6. 


2.1 Show that the ring of integers in Q(cos 2) is Z[cos =*]. 


? Show that the class group of Q(93) (is this the right one?) is nontrivial. 
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Chapter 20 


Valuations and completions 


valuations-and-completions Here is some motivation for considering p-adic fields. 


1. One useful tool in arithmetic geometry is the local to global principle, which says that 
the existence of solutions modulo all primes tells us something about the existence of 
solutions in the original field or ring, such as Q or Z. For example, the Hasse-Minkowski 
Theorem. However, it is not enough to check for solutions modulo all powers of p — 
because a solution modulo p does not necessarily give a solution modulo powers of p. 
The solution is to look for solutions in a field which contains information modulo all 
powers of p, a p-adic field. 


2. When we take a p-adic fields, the only prime ideal remaining is p; all others primes 
become units. This vastly simplifies algebraic number theory; we don’t have to worry 
about primes that split. Then we can recover facts about the global field. 


1 Case study: p-adic integers 


padic-exs We first examine how p-adic rationals are defined, before generalizing to other number 
fields. 

Often we look at the integers modulo higher and higher powers of a prime p; for example, 
when we were looking at the existence of primitive roots (Theorem |4]4.6.2) or the structure 
of Z/p"Z (Theorem [4]4.7-1). Hensel’s lemma told us that under certain conditions we can 
lift solutions modulo higher and higher powers of p. 

Rather than work with powers of p piecemeal, we can devise a structure that holds 
information modulo all powers of p at once. To do this, we define the ring p-adic integers Z, 
and p-adic rationals Q,, which contain Z and Q, respectively. We will do this in two ways: 


1. Define Z, as an inverse limit of the rings Z/p"Z and Q, as the fraction field. 


2. Give Q a topology (or even better, a metric) related to divisibility by p, and complete 
Q with respect to this topology. 


185 


Number Theory, 820. 1 


1.1 p-adics as an inverse limit 


Definition 20.1.1: -adic-comp-seq A p-adic integer is a compatible sequence 


(Zn )n>1 


where x, € Z/p"Z and such that %n41 = £, (mod p”) for all n, ie. %p41 maps to x, under 
the projection map Z/p"t'Z > Z/p"Z. 

The ring structure is defined by componentwise addition and multiplication. The ring of 
p-adic integers is denoted by Z, and its fraction field is denoted by 


Q, = Frac(Z,). 
In light of Theorem |12}12.7.5| we can phrase this definition in a more abstract way: 


Lig, = lim Z/p"Z 


where there are maps y"" : Z/p"Z — Z/p"Z given by projection whenever m > n. 


1.2 p-adics as completions 


We can give define a topology on Z by decreeing that it be invariant under translation and 
that a neighborhood base of 0 be {p"Z,n > 0}. This is the same as the topology induced 
by the norm 


U 


b 
pt b,c. 


’ 
Cc 


lal, =p ”° when a= 


Definition 20.1.2 (Alternate definition of p-adics): Q, is the completion of Q with respect 
to the p-adic norm. 


We show the equivalence more generally in (). 


1.3. Units in Z, 
Proposition 20.1.3: The group of units in Z, is 


gx mw J ErXBl(P-VZ, pp #2 
. | Za x Z/22. x 2/22, p= 2. 


Proof. Note that 
Z = \im(Z/p"Z)* 


n>1 


because any inverse modulo p” can be lifted to an inverse modulo p"*!. 
The proposition follows from taking inverse limits in Theorem 
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1.4 Monsky’s Theorem* 


We use the 2-adic valuation to prove the following theorem from combinatorial geometry. 
Surprisingly, no proof is known that does not use p-adics. 


Theorem 20.1.4: A unit square cannot be cut into an odd number of triangles of equal 
area. 


The idea of the proof is as follows. 
1. Extend the 2-adic valuation to a nonarchimedean valuation on the real numbers. 


2. Color each point in the plane one of three colors, based on the 2-adic valuation of 
the coordinates. We show that the sides of the square only have two colors, with 
the vertices alternating colors, and that a triangle of area + where m is odd, cannot 
contain vertices of al three colors. The last facts depends crucially on the fact that the 
area formula for a triangle has a factor of 5 in it. 


3. By Sperner’s Lemma (from graph theory), the coloring in such a subdivision is incon- 
sistent. 


Proof. We postpone the proof of the first item.! Assuming it, color the points of the plane 
in three colors depending on which of the following conditions is satisfied. 


(A) |zl2<1,|yle<1 
(B) |zlo > 1,|zl2e > lyle 
(C) |yl2 = 1, |yl2 > |xl2 


First, we show that if (Aw, Ay) has color A, then translating by (Az, Ay) does not change 
the color of A. Indeed, consider 3 cases. 


1. (x,y) is of color A. By the nonarchimedean property, we have 
Jz + Azz < max(|2|2,|Azl2) < 1, ly + Ayl2 < max(|yl2, |Ayl2) < 1, 
so (x + Az, y + Ay) is again of color A. 
2. (x,y) is of color B. Since |z|2 > 1> |Az|2, we gave 
jz + Aga = |z|2 > 1. 
Since |x|2 > |ylz and 1 > |A,|2 we have 
ly + Ayl2 < max((yl2, Ayl2) < |el2 = |e + Ace. 


Hence (« + Az,y + Ay) is again of color B. 


'There is a way around it; see Proofs from the Book. 
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3. (x,y) is of color C. The proof is the same as above except x,y are interchanged and 
there is strict inequality in the dotted inequalities above. 


Now suppose that A, B,C are three points of those respective colors. By translation we may 
assume that A = O. Let B = (x,y) and C = (2’,y’). We have 


Rap = ly|2 
ly'|2 > |rlo 
=> |xy'l2 > |2"yle. 


1. A,B,C cannot be collinear, as that would imply xy’! = z'y. 


2. We show A, B,C cannot form a triangle of area at for m odd. The area is +3(ty'—z2'y), 
and we have 


1 / / 1 / 

=(zy/ — = |= $a 

j5(0y' — 2’ = [5 Ielely'le > 1, 

while FA al 

Next we establish the following combinatorial lemma. 
Lemma 20.1.5 (Sperner’s lemma): Suppose P is a polygon that has been subdivided into 
triangles. Define a verter or segment to be a vertex or edge of one of these triangles, and 
say a segment is of type C,C2 if the endpoints are colored C; and Cj. We say a triangle is 
rainbow if it has vertices of all 3 colors. 


Suppose every vertex of the subdivision is colored with either A, B, or C, such that the 
following hold. 


1. No outer edge of P contains vertices of all three colors. 

2. There are are an odd number of segments of type AB on the outer edges. 
Then P contains a triangle whose vertices are all different colors. 
Proof. We count the number of segments of type AB. In a monochromatic triangle the count 
is 0, in a two-colored triangle the count is 0 or 2, and in a three-colored triangle the count 
is 1. Let n be the sum of the counts over all triangle. Every interior segment of type AB is 
counted twice, as it is part of two triangles, so 

n=2i+e, 

where 7 and e denote the number of interior and exterior segments of type AB. Since e 


is odd by assumption, n is also odd. But this can only happen if there is a three-colored 
triangle. 
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Now the points O = (0,0), X = (1,0), Y = (1,1), and Z = (0,1) are colored with A, 
B, B, C, respectively. We’ve shown that each side contains segments of at most 2 colors; 
segments of type AB can only appear on side OX and XY; in the former there must be an 
odd number (since O, X are different colors) and in the latter there must be an even number. 
Thus the conditions of Sperner’s Lemma are satisfied, and any subdivision must contain a 
rainbow triangle, which cannot have area at for m odd. 


2 Valuations 


Definition 20.2.1: A valuation on a field K is a function |-|: AK — R such that 


1. |x| > 0 with equality only when x = 0. 
2. |xy| = |2|lyl- 
3. |e t+y| < [x] + [yl 
If the stronger condition |x + y| < max(|z], |y|) holds, then | - | is nonarchimedean. 


Example 20.2.2: For a number field AK, any embedding 0 : K <— C gives a valuation on 
K: 
|a| := |oal. 


Example 20.2.3: -acdicvai The p-adic valuation is 


1 Up (a) 
b= (55) 


In the special case K = Q, p = (p), we have 


1 Up (a) 
a= (=) | 
’ \p 


Proposition 20.2.4: nonarchcrit A valuation is nonarchimedean if and only if it is bounded 
on Z. Hence if char(/’) 4 0, then Kk only has nonarchimedean valuations. 


Proof. If |-| is archimedean, then |1+---+ 1] < |1] =1, so |n| < 1 for any n € Z. 
Conversely, suppose that |-| is bounded on Z, say by C. We have 


(") akpr-* 
kao \K 


< Ye Clal*a* 
k=0 
< C(n + 1) max(|al, |b|)”. 


(a+ b)"| = 


Hence for all n > 1, ja +b] < (C(n +1))* max(|al, |b]). Taking n > 00 gives the result. 
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Proposition 20.2.5 (Relationship between additive and multiplicative valuations): Fix a 
base b. There is a correspondence between additive and multiplicative valuations, given by 


jel =o" 
v0) = —log,(2. 


Different values of b give equivalent valuations. If v(/*) is discrete in R, then it is a multiple 
of a discrete valuation. 


We say |-| is discrete when |k*| is a discrete subgroup of Ryo. 
Using the above correspondence, we find 


1. A:={ae€K: |a| < 1} is a subring of K, with 
2. U :={ae K : |a| = 1} as its group of units, and 
3. m:= {a € K : |a| < 1} as its unique maximal ideal. 
The valuation is discrete if and only if m is principal; then A is a DVR. 
Proposition 20.2.6 (Elementary properties of discrete valuations): ciem-prop-av 
1. |a +6] < max(|al, |b|) with equality if |a| F |b). 


2. (“All triangles are isosceles.” ) If d(c,b) < d(c,a) then d(a,c) = d(a,b). (The longer 
side is the repeated one.) 


3. If a; +--- +a, = 0, then the maximum valuation of the summands must be attained 


for at least two of them. 


2.1 Equivalent valuations 
A valuation on K defines a metric (and hence a topology) on Kk by 
d(a,b) = ja — 0]. 


For example, high powers of p have small p-adic valuation, so numbers differing by high 
powers of p are close together in the p-adic valuation. 


Proposition 20.2.7: vat-equiv-crit Let |-|1, |-|2 be valuations on A’, with the first being nontrivial. 
Then the following are equivalent. 


1. |-|1,]-|2 determine the same topology on K. 
2. If jal; < 1, then |alz < 1. 


3. |- |1 =|- | for some a > 0. 
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We say that |- |, and | - |, are equivalent if the above conditions hold. 


Proof. 
(1) => (2): Note |a|; < 1 if and only if |a"|; = |a|? — 0, ie. a” converges to 0 in the 
topology of |-|;. Since the topologies are the same, 


al; <1 <=> a” converges to0 <= > |als < 1. 


(2) = > (8): Take y so that |y|; > 1, and let a = tute, so that |y|2 = |y|{. We show that 
|z|2 = |a|f for alla € K. 

Suppose |z|, = |y|?! and |a|2 = |y|%. We need to show b; = by, ie. so the following 
commutes. 


Na 
(ee P 
|x| > || 


“a ] “nl 


ap st ly|2 


We approximate 6; with rational numbers “. First suppose b; > %. Then 


=) = Wino <1 
vl 
so by hypothesis 
m—bon __ y™ 
=|—| <1 
ig = || 


giving by > ™. Similarly, if b} < ™, then the above argument with a shows by < ™. Since 
Q is dense in R, we have b; = by. 


(3) == (1): The open ball of radius r with respect to | - |; is the same as the open ball of 
radius r* with respect to | - |». 


3 Places 


Definition 20.3.1: A place is an equivalence class of nontrivial valuations on K.? We 
denote by Vx the set of places of K, by V2 the set of nonarchimedean places and V° the 
set of archimedean places. 


We aim to classify all places in a number field K. 


Proposition 20.3.2: nonarch-vas Let A /Q be an algebraic extension. Then the places on 
are exactly the p-adic valuations | - |, for p a prime ideal of Ox. 


?Some books use “prime” instead of “place.” We use the latter term to avoid confusion. 
p 
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Proof. Since K is algebraic over Q, an element a € @x satisfies a monic polynomial equation 
with coefficients in Z: 
gn dee ae — 0) 


By Proposition |20.2.4| a; € Z gives |a;| < 1. By the nonarchimedean property, 


lo|” = |an-1a™ 1 +++» + a9) < max |am||lv"| < max |a|™. 
0<m<n-1 0<m<n-1 
Hence |a| < 1. 

Let B be the ring of integers of |- | and m its maximal ideal. Since m is prime in B, 
p:= mf A is prime in A. Note p ¥ (0) because if so | - | is trivial. 

Now suppose vp(y) = n. Let 7 € p\p? be a uniformizer. Then (yz~") is a fractional 
ideal; suppose ideals p1,..., Pm appear in its factorization with exponents at least —k. Take 
b € (\_,p?. Then (yb) is an integral ideal (c) not divisible by p. We have c € A\p. 
Writing |7| = (4)’, we have 


n 
Cc 


ly| = 5 


e 1 
a= | 


Np 


= lylp- 


Moreover, two equivalent nonarchimedean valuations would have the same maximal ideals 
and hence correspond to the same prime p. 


Theorem 20.3.3 (Ostrowski): ostrowsii The following is a list of all places on Q. 
1. Archimedean: | - |... 
2. Nonarchimedean: | -|,, where p ranges over all primes. 


Proof. Let |-| be a valuation on Q and m,n be integers greater than 1. To compare |m| and 
\n|, we write m in base n: 


m=a,n" +---+a9, O<a,<n-—1l,a,>0. 
Let N = max{l1,|n|}. Then by the triangle inequality, 
|m| < So a,N*. 
k=0 
Since r < am we get 


1 1 nm nm 
mi|< + —+—4+--- n un < On me 
<(1 nwt 7p N 


3A stronger version of part 1 is as follows. Let K be complete with respect to an archmimedean norm. 
Then Kk = R or C, and the norm is the normal absolute value raised to a power in (0, 1]. 
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Replacing m by m* and taking the tth root gives 
|m| < (2n)* Nimr 


Taking t > oo gives 
Inm 


m| < Ninn, (20.1) 


ostrowskil 


Consider two cases. 


1. For all integers n> 1, |n| > 1. Then (20.1) gives |m|mm < |n|m=. By symmetry, we 
get |m|mm = |n|m. Since this is true for all m and n, |n|m= = c is constant, ice. 


for all n € Z. Since Z generates Q as a group, we get that |-| is equivalent to the 


standard archimedean valuation. 


2. For some n > 1, , <1. Then shows that |m| < 1 for all m > 1. Thus by 
Proposition 4} | - | is ae The nonarchimedean valuations are given 
by Proposition 70.3.2 


Later on we will return to the question of finding all valuations on an extension of Q 
(Theorem ??). 
Generalize the nonarchimedean stuff to number fields. 


3.1 Approximation 


Theorem 20.3.4 (Weak approximation theorem):  tim:weat-approx Let V1,..., Un be all the places 
of K, with valuations |-|1,...,|+|n,. The map 
N 
6:K 3] Ky, 
j=l 


induced by the inclusions kK > K,, has dense image. 
In other words, given a,,...,a, € K, for any ¢ > 0, there exists a € K such that 


la — a;|; < € for all j. 
Proof. Step 1: We show that there exists a such that 


weak-approx-1 lal, > ‘lls (20.2) 
lal;< 1, ¢=2,...,n. 


We induct on n. For n = 2, note that by Proposition |20.2.7(2), we can find b,c so that 
ola <1, Jol2 21 


Icla > 1, Iclo < I; 
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Now take a = Fe 


For the induction step, suppose we’ve found b so that (20.2) holds for n — 1. Choose c so 
that 
lela, ely <d: 


we will use it to “correct” |b|,, as necessary. Consider three cases. 
1. |b], < 1: We can let a = b. 
2. |b|, = 1: Let a = b’c, for large enough r. This works because 
(50, j=l 


255 SH 1 
UICln <1. (=, 


lim |b"e|; = 40 


o] 


3. |bl, > 1: First note that from 1—|a"| < |1+a"| < 1+ |a’| we get 


r 


é Xx 
weak-approx-expr lim 
r—oo | 1 + x" 


( 

0 =I 

_)% Fl (20.3) 
E,. “ee| Sed, 


Let a = or for large enough r. This works because the above gives 


(\ci>1, j= 
7 tlen <1, jan. 


Step 2: Now we show that there are points in the image of ¢ arbitrarily close to (1,0,...,0). 
Indeed, choosing a as in step 1, we have by (20.3) that 


a" 
lim ( = 10.9500): 
ae l+ar ( ) 


Step 3: From step 2, choose b; sufficiently close to (0,... :0, 1, 0,...,0). Let 
J 


n 
a= S- Andn 
j=l 


to find y(a) can be arbitrarily close to (a1,...,@n). 


Note that if we include only the finite places, then this follows from the Chinese remainder 
theorem. 
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4 Completion 


Definition 20.4.1: Let A be a field with valuation | -|. The completion of /’, denoted K 
is the field containing K (i.e. there is a injection K < K preserving valuation) satisfying 
the following properties. 


1. K is complete in its topology. 


2. (UMP) For any homomorphism y from K’ to a complete field L, there exists a unique 
homomorphism K — L making the following commute. 


17 
k 
Le., K is the smallest complete field containing K. 


Proof of existence. For existence, let K be the set of equivalence classes of Cauchy sequences 
in K, and deem two sequences {a,,} and {b,,} equivalent if limp. lan — bn| = 0. Define 
Rak by sending a to (a,a,...). Extend the valuation by letting defining the norm of a 
{an} to be limy-s0 |@n|. See any book on real analysis for the details. 

For the second part, given a sequence {a,,} € K, map it to lim, 5. yp(an) € L. Uniqueness 
follows from the universal property. 


4.1 Completions of archimedean fields 


Theorem 20.4.2 (Ostrowski): ostrowski2vatson-nmber-teids The only complete archimedean fields, 
up to isomorphism of valued fields and equivalence of valuation, are R and C. 


Proof. See Neukirch, p. 124. 


We can now finish our classification of places on k/Q. 


Theorem 20.4.3 (Classification of places of A’): Let A be a number field. There is exactly 
one place of K for each 


1. prime ideal p, 
2. real imbedding, and 


3. conjugate pair of complex embeddings. 


The valuations corresponding to prime ideals, i.e. p-adic valuations, are called finite 
places, while the those corresponding to real and complex embeddings are called infinite 
(real or complex) places. 
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Proof. The nonarchimedean valuations of K are given by Proposition ??, while each archimedean 
valuation v corresponds to an embedding (respecting valuations) 


Ko k,=RorC, 


the isomorphism coming from Theorem |20.4.2| Note that complex conjugate embeddings 
give the same valuation. 


Corollary 20.4.4: Let L/K be extensions of number fields. If v is a place corresponding 
to a prime p of K, then the places w | v in L correspond to primes $$ | p. If v is a place of 
K corresponding to an embedding o : K — R or C, then the places w | v correspond to of 
ao to L. 


4.2 Completions of nonarchimedean fields 


Suppose Kt is a field with a discrete nonarchimedean valuation | - |. Let 7 be a local uni- 
formizing parameter, i.e. the largest element of K with |z| < 1. Equivalently, 7 generates 
the maximal ideal m in the subring of z-integers. 

Since K is dense in K and 


JA \{O}] = {|a]":m € Z} 
is discrete in K, we get |K| = |A\. 


Proposition 20.4.5: piadic-expansion Let S be a set of representatives for A/m. Then every 
element of K has a unique expression in the form 


S- Ont 


n>N 


More precisely, the sum represents limyn_.55 0)". Gn7”.) The norm is given by 
n=N 


= ae an = 0. 


S- Ant” 


n>N 


In other words, we can write elements of K as “numbers with infinite 7-expansions going 
off to the left,” as we saw in section 


Proof. Let {8n}n>1 be a Cauchy sequence in kK’. Let 


a = S- An(m)n4; 


m>—oo 
where a,(m) € S; this sum is finite. We have 


[iis oo Sno | =p min{ man (m) Aang (m) } 
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Hence for each m, a,(m) eventually stabilizes, say at a,. Then 


im. Sn = S- AnT 


n>>—oo 


Thus we have two ways to think of a p-adic valuation. 


Proposition 20.4.6: 
k= Frac(lim A/m"). 


To connect up the analytic and algebraic definitions of the completion, note that the 
completion of a ring A with respect to an ideal m is defined as A = lim A/m”, with the 
topology given by a neighborhood base at 0 being {m”},,>0. 7 


Definition 20.4.7: Define the exponential function as a power series 


We investigate the convergence of e*. Writing a = a,p" +---+ do in base p, we find by 


Example that 


ord,(n!) = 


Hence 


ord, (=) =nord,(2) = mos = 
n! p- 


Since e” converges if and only if ord, (= +) — —oo, we get the following. 


Proposition 20.4.8: e* converges for ord,(x) > ae 


5 Hensel’s lemma 


The following is the first version of Hensel’s lemma for z-adics. COMPARE TO ELEMEN- 
TARY STATEMENT IN TERMS OF MODS. 


Lemma 20.5.1 (Hensel’s lemma, I): tense Let f(X) € A[X], and ap be a simple root of 
f(X) modulo 7, i.e. f(a9) = 0 (mod 7) and f’(a9) #0 (mod 7). Then there exists a unique 
root a of f(X) with a = ao (mod 7). 


Note this can be generalized as follows: Suppose f(ag) = 0 (mod 7”) and v,(f’(ao)) = 
k <n. Then there is a unique root a of f(X) with a = aj (mod 7”*). The proof is the 
same, and is left to the reader! 


197 


Number Theory, §20.5 


Proof. We find zeros of f(X) modulo higher and higher powers of 7. 
Using induction, we find a, satisfying 


f(a@n) =0 (mod a”), 


The base case holds by hypothesis. For the induction step, note that by Taylor expansion 
of polynomials, 


f(@n + ha"**) = f (Qn) + ha” f'(Gn) eee 
= f(a,) + ha™*' f'(a,) (mod 2”**), 
Since f’(an) # 0 (mod 7) and f(a,) = 0 (mod z”*'), we can choose h so that this is 0 
modulo 2"*!. (Explicitly, h = a) . Fay’) We let dn41 = Gn + hat. By construction, 


the sequence a, converges; let a be its limit. Since a = a, (mod 7”), we get f(a) = f(an) =0 
(mod r”*") for all n, and therefore f(a) = 0. 


The first form of Hensel’s lemma tells us about lifting a root ao of f (f modulo 7) to a 
root a of f in K. We can think of this as lifting a linear factor x — ao of f to a linear factor 
x —a of f. A stronger form of Hensel’s lemma says that we can in fact lift any factor of f 
to one of f. 


Theorem 20.5.2 (Hensel’s lemma, II): nensei2 Let & be the residue field of A and f be a 
monic polynomial. If f = goho where go and ho are monic and relatively prime, then f = gh 
for some g and h such that g = go and h = ho. (uniqueness) 


If f = g1--- Gn is the complete factorization of f in k[X], then the complete factorization 
of f in K[X] is f = fi--- fn where f; = gj. 


Proof. First we need the following lemma, which tells us that if the reductions of polynomials 
are relatively prime, then so are the original polynomials. 


Lemma 20.5.3: Let A be a local ring with residue field k. If g,h € A[X] are such that 
g and h are relatively prime, then g and h are relatively prime in A[X] and there exist 
polynomials u,v with degu < degh and degv < deg g such that 


ugt+vh=1. 


Proof. Since g and h are relatively prime in k[X] = (A/m)[X], (g,h) = A[X]/mA[X] and 
(g,h) + mA[X] = A[X]. Since A[X]/mA[X] is finitely generated (on account of g,h being 
monic), by Nakayama’s Lemma (g,h) = A[X]. We can choose u,v such that ug + vh = 1; 
drop all terms with higher degree. 


We proceed as in the proof of Theorem |20.5.1| Suppose we have found g, and h, such 
that 
f= anh, (mod 7"). 
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We have 


(gn tun") (hy + unt) = gnhn + (Ugn + vhy)n™** (mod 2”*?), 


By the lemma we can choose u and v such that the above is congruent to f modulo 7"*?. 


Again let gn41 = Qn tum"), Anyi = hy + un"™!, and take the limit as n > oo. 
The second part follows from induction. Note f = f,--- f, is the complete factorization 
because any factorization of f gives a factorization for f. 


Definition 20.5.4: A henselian field is a field with nonarchimedian valuation v which 
satisfies Hensel’s Lemma (with p the maximal ideal corresponding to v). 


Hensel’s lemma says that a field that is complete with respect to a discrete valuation is 
henselian. 


6 Extending valuations 


Theorem 20.6.1 (Extending discrete valuations): extena-aiscrete-vamations Let AY be henselian and 
let L/K be finite separable of degree n. Then |- |x extends uniquely to a discrete valuation 
|- |, on L, given by 


[Blt = | Nmy/x Blk. 


Proof. Neukirch, pg. 131-132. 


Definition 20.6.2: Let K be henselian. Let ord : K* — Z be the corresponding additive 
valuation, extended to K*!* — Q. Given a polynomial 


f(X) =X" +a, 1X" | +--+ a9 € K[X] 
define the Newton polygon of f(X) to be the lower convex hull* of 
P, = (0rd (a). 


Proposition 20.6.3: Suppose the bottom of the Newton polygon has segments of x-length 
n; and slope —s;. Then 


1. f(X) has exactly n; roots a € K® with ord(a) = s;, and 
2. fi(X) = Tloraa;)=s;(X — ai) has coefficients in K. 


Proof. We prove the following statement by induction: if f(X) = [](X —a;) € K[X] and 
exactly n; of the roots a; have order equal to s;, then the Newton’s polygon of f(X) has a 
segment of slope —s; and x-length n,. 


‘draw the convex hull, and remove the segments joining (0, ord ag) and (n,0) from the top 
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The case n = 1 follows since the only line segment on the bottom joins (0, ord(a@;)) and 
(1,0). Now suppose the claim proved for n. Consider 


n+l 


g(X) = (X — a) f(X) = Yo (aur — acy.) X" 


k=0 
(where nonexistent coefficients are set to 0). Let t = ord(a). Let kg be the point such that 


the slopes of the line segments of Newton’s polygon N for k < kg are s < —t, and such that 
the slopes of the line segments of N for k > ko are greater than s > —t. Let 


dy, = ord(ax) 
¢;, = y-value of intersection of N with x =k 


d,, = ord(az—1 — aax) 


ee) ee ee ee 


Z ion 0<kh< hy 
Let N’ be the broken line formed by joining (k, ¢,). N’ consists of segments of the same 
slopes as NV, plus one more segment of slope —t and x-length 1, in increasing order. It suffices 
to show that N’ is the lower convex hull of the points (k, d;,). 

Here is an example with p = 5, f(X) = (X —5)(X —10)(X —15)(X —125) and a = 25.° 


Consider 2 cases. We will use 
d,, = ord(ax_1 — aay) > min(ord(ag_1), ord(aa;,)) = min(d,_1, d, + t), 
with equality holding if d,_, 4 d, +t. 
1. k < ko: We have 


(*) 
dp_-1 > Cp-1 > Ce +t = G,, 
djtt>&+t=&, 


°Of course, f does not have to split over Q[X] and the valuations don’t have to be integers. 
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7 


where in (*) we use the fact that the slope of the segment (k — 1, €,-1)(k, @,) is at most 
—t. Hence (k, di.) lies above N’. Now suppose (k, d;) lies on a corner of L (excluding 
k = ko). Then d, = & and inequality holds in (*): 

dy >h+t=l=d,+t 


so dj, = ¢;, and (k,dj,) lies on N’. 


. k > ko: We have 


pre ha—s, 


ae oe rer ae =. 
where in (*) we use the fact that the slope of the segment (k — 1, ,_1)(k, &,) is greater 
than —t. Hence (k, di.) lies above (k, ¢,,). Now suppose (k — 1, dy_1) lies on a corner of 


L. Then dy—1 = ln—-1 sO 


dptt>pt+t>dp1=le-1=%, 


showing dj, = ¢,, and (k,di,) lies on N’. 


Places as Galois orbits 


Here is an alternate definition of a place. 


Definition 20.7.1: Let (K,v) be a field with valuation and L/K be an extension. A place 
on L over v is a G(K,,/K,)-orbit on Hom (L, K,). 


Example 20.7.2: Let K = R, and L a fiinite extension of K. Then the places of L 


over 


G(C/ 


R are just Hom,(L,R), the real embeddings of L, and the complex places are just 


R)\ Hom, (L, C), i.e. pairs of complex conjugate embeddings. 


We show this is equivalent to our previous definition. 


Theorem 20.7.3: Assume... There is a bijective correspondence between equivalence classes 
of valuations w | v, v on K, and G(K,,/K,)-orbits on Homx(L, K,): 


{w]lv:weé M;} > G(K,/K,)\ Home (L, K,). 


Letting 0 be the unique extension of v to K,, the embedding 7 : L @ K, is associated 
to the valuation | - |; restricted to L. 
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8 Krasner’s lemma and consequences 


The following is a surprising result... Krasner’s lemma 


Lemma 20.8.1 (Krasner’s lemma): icasner Let AK be complete with respect to a nonar- 
chimedean valuation | - |, and extend |-| to an algebraic closure K*!. Let a,8 € K. 
If 6 is separable over K’[a], and 


B=oe|<|f =| (20.4) 


for any conjugate 3’ # 6 of 6 over K, then 6 € K{al. 


belong-ineq 


We say that a belongs to G if inequality (20.4) holds. 


Proof. By the fixed field theorem, it suffices to show that for all embeddings o : K(a, 8) 9 
k*' fixing K(a), that o(G) = 8. We have 


|o(8) — a] = |o(8) — o(a)| = |B — al 


since |e | = |ae| and o(a) =a. Hence 


the last following since |-| is nonarchimedean. By the minimality assumption we must have 


o(B) = 8. 


We define a norm on polynomials by setting 


n 
yaax'| = max [cl 
= 0<k<n 


Using Krasner’s Lemma, we show that polynomials that are close together have roots that 
are closely related. 


Proof. Choose 6 so this last quantity is at most minjz,;|a; — a;|. Then by Krasner’s 
Lemma |20.8.1] a € K[G]. Since 6 and a both have degree n over K, K(a) = K({). 


In fact, we have the following stronger result. Using Krasner’s Lemma, we show that 
polynomials that are close together have roots generating the same extensions. 


Theorem 20.8.2: tcasner-pory Given f, there exists « > 0 such that if ||f—gl|| < ¢, then 
there is an ordering of roots a1,...,Q@, and (1,...,@8n of f and g, respectively, counting 
multiplicities, such that K(a;) = K(;). 

Proof. Step 1: First we show that the roots of g approach the roots of f, as || — g|| > 0. 
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Lemma 20.8.3: roct-continuityi Keep the hypothesis of the theorem. Suppose ¢« > 0. Then 
there exists 6 > 0 such that if || f — g|| < 6, then for every root (6 of g, there exists a root a 
of f such that |8 — al <e. 


Proof. First note that the roots of a monic polynomial h are bounded in terms of ||Al]. 
Indeed, letting h(X) = Vip ceX*, if 7 is a root of h, then by Proposition |20.2.6(3), we 
must have cyy* > y" for some 0 < k <n, and hence 


1 


4 <efF < max(1, |All). 


Suppose || f — g|| <6 is small (say, less than 1). Then ||g|| < || f|| +6, which is bounded. 
Hence the roots of ||g|| are bounded, say by C. Let @ be a root of g. On the one hand, we 
have 

fanction-continnity( f —_ g) (3B) < If — gl max{|G|", 1} < dmax{C”, 1} (20.5) 


and on the other, 


(f — 9)(8) = (8) = TI (6 — an). 


k=1 


Hence |@ — ax| < (5max{C”, 1}) for some n. We can choose 6 so this is less than e. 


Step 2: We strengthen the lemma to account for multiplicities. 


Lemma 20.8.4: sovt-continuitys Keep the hypotheses of the theorem. For every ¢ > 0 there 
exists 6 > 0 such that whenever || f — g|| < 6, there exist orderings a1,...,Q@, and (1,...,6n 
such that |G, — a,z| < € for all k. 


Proof. By Lemma |20.8.3} as || f —g|| — 0, the distance from the roots of g to the closest 
roots of f approaches 0. Let 6,(g),...,Gn(g) be the roots of g. For each k let a,(g) be 
the root of f closest to G,(g). We have max, |3.(g) — axz(g)| > 0 as g > f. Suppose the 


distinct roots a/,...,a/,, of f have multiplicities 71,..., 7, and suppose that they occur with 
multiplicities $1,..., Sm in the a,(g). Suppose by way of contradiction that (51(g),..-,Sm(g)) 
is not constantly (r1,...,7m) for g close enough to f. Then we can find a sequence g; > f 


such that (s1(g;),.--,5m(g,;)) is constant and not equal to (r1,..., 7m). Then 


ai(X) = TE (X — Belo) > TX —04(9)) & TEX — 04) = F000), 


> 
ll 
mn 
> 
ll 
an 
> 
ll 
mn 


contradiction. 


Step 3: Take ¢ = minjz; |a; — a’,| in Lemma|20.8.4| Then Krasner’s Lemma|20.8.1] gives the 


conclusion. 


From this we get that every field extension of Q, can be described by a field extension 
of Q, by choosing a close enough approximation to a minimal polynomial. 
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Corollary 20.8.5: Let L/Q, be a finite extension. Then there is a finite extension K/Q 
such that [K : Q| = [L:Q] =nand K-Q,=L. 


Proof. Using the primitive element theorem, choose a so that Q,(a) = L. Let g € Q,[X] be 
the minimal polynomial of a. By Theorem }20.8.2| for g close enough to f, there is a root 6 
of g such that Q,(a) = Q,(6). Take g € Q[X] sufficiently close, and L = K(f). Then 


K -Q, = K(a) = K(8)=L. 
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Chapter 21 


Local and global fields 


lg-fields 


1 Topology of local fields 


Definition 21.1.1: A local field is a field K with a nontrivial valuation |-| such that K 
is locally compact. 


Note this requires that K is complete. 


Proposition 21.1.2: sing otintegers-compact Let K be complete with respect to a discrete nonar- 
chimedean valuation. Then A is compact if and only if k := A/m is finite. 


Proof. Suppose A is compact. Note m = {x: |x| < 1} is open, and any translate of it is 
open. Note A = Lac4/m@ +m where the union is over representatives in A/m. A finite 
number of these cover A, so k is finite. 

Conversely, suppose k := A/m is bounded. It suffices to show that A is closed and totally 
bounded!. 


1. A is closed since A = {x: |x| < |x|}. 


2. A is totally bounded: Given ¢ > 0, choose r so that ||"tt < ¢. Now every element 
is in a ball of radius 1 centered at one of the finite number of points in the form 
ao tayn +---+a,7". 


Proposition 21.1.3: compact-setsinir If A’ has finite residue field then Gf, p", and 1+ p” are 
all compact. 


Proof. From Proposition |21.1.2, A is compact. The above are all closed subsets of A so 
compact. 


TA set is totally bounded if for every r, A can be covered by a finite number of sets with diameter at most 


205 


Number Theory, §21.2 


Theorem 21.1.4: The following is a complete classification of local fields, up to isomor- 
phism. 


1. R and C with the usual metric. 


2. Finite extensions of Q,. 


3. Field of formal Laurent series k((T’)) over finite field. 


Proof. Neukirch, p. 135. 


Fill in: local fields vastly simplify things because... 


1.1 Open sets and continuity 


Proposition 21.1.5: poweropen For any local field K and any n, the nth power map is open 
on K™, i.e. it takes open subsets of K™ to open sets. 


Proof. For K = R or C, this is clear. 

For K a-adic, this is an easy consequence of Hensel’s Lemma. Let y © K*”". We may 
suppose u(y) = 0. Suppose x} —y = 0. Let k = v(p) and let € be such that v(e—y) > 2k+1. 
Consider the polynomial f(x) = 2” — y. Now f(xo) =0 (mod 7?**+) so by Hensel’s Lemma 
xo lifts to a solution of f in kK. (The version of Hensel in ACIM, p. 14. Add this in.) 


Proposition 21.1.6: pr:nmcon For any extension of local fields L/K, any 0 € G(L/K) acts 
as a homeomorphism, and the norm map Nmz,x is continuous and open on K™. 


2 Unramified extensions 


Definition 21.2.1: Let kK be a complete field with residue field k; let L be a finite extension 
of K with residue field 1. We say L/K is unramified if 1/k is separable and the prime ideal 
p in Ox does not ramify in L. 

L/K is totally ramified if p ramifies completely; by the degree equation this is equivalent 
to l= kh, 


Note from the residue equation that 
doesnt-ramify) does not ramify ——s [L H iq ae [ js kj. (21.1) 


Our main theorem of this section is Theorem We will show that if L/K is 
unramified, then //k is separable. If 1/k is separable, though, we need an extra condition to 
make sure L/K is unramified; namely that a minimal polynomial for L/K stays a minimal 
polynomial for //k, so that holds. 
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Proposition 21.2.2: complete-unramified-criteria Let K be a complete field with residue field Ke let 
L be a finite extension of K with residue field 1. Suppose L = K(qa), and let g(x) € K [a]. 
The following are equivalent. 


1. L/K is unramified, and g is the minimal polynomial of a. 


2. |/k is separable, with | = k(@), g has a as a root, g is the minimal polynomial of @, 
and g has no repeated roots. 


Proof. Suppose (1) holds. Then g has @ as a root. Note L = K(a) gives! = k(@). By (21.1), 
@ has degree [I : k] = [ZL : K] over k. Since g has degree [L : K], it must be the minimal 
polynomial of @, and have no repeated roots. This shows |/k is separable. 

Suppose (2) holds. We have 


[L: kK] <degg=degg=[l: kl, 
the last equality following since 7 is the minimal polynomial of @ But [LZ : K] > [Il : kj, 


so equality holds and p (the prime ideal of @x) is unramified by (21.1). Thus L/K is 
unramified. 


For local fields, the property of being unramified behaves well under extensions and 
products. 


Proposition 2 1 . 2 3 unramified-props 


1. Suppose that kK C L C M are finite extensions. If W/L and L/K are unramified, then 
M/K is unramified. 


2. Suppose that K C L,M are finite extensions. If L/K is unramified, then LM/M is 
unramified. 


3. Suppose that kK C L,M are finite extensions. If L/K and M/K are unramified, then 
DLM/K is unramified. 


M LM LM 
L L | unram M 


L M 
ik i 


Proof. Let k,l,m,n be the residue fields of K,L,M,IM, and p, 38, and 8 be the prime 
ideals of Gx, GL, Gx, respectively. 


1. We have p@y = BO, =’. Separability is transitive, so M/K is unramified. 
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2. Write L = K(a). By Proposition we can find g with a as root such that g 
is the minimal polynomial of 1 = k(a) over k, and is separable. Then the minimal 
polynomial for n = m(a) over m divides g, hence is separable. By Proposition [21.2.2] 
again, LM/M is unramified. 


3. By part 2, LM/M is unramified. Since M/K is unramified, by part 1 LM/K is 
unramified. 


Theorem 21.2.4: unramified-is-separable Let K be a field; fix an algebraic closure. There is an 
equivalence of categories between 


e finite unramified extensions L/K, and 
e finite separable extensions |/k. 

Ly ————_> Ln 

| | 

by = L1/p1 — lz = L2/po. 

Moreover, 
1. L CM if and only if 1 Cm. 
2. The residue field of LM is Im. 
3. L/K is Galois if and only if //k is Galois, and 


G(L/K) = G(1/k) 
by restricting 0 € G(L/K) to B = G@; and modding out by BB. 


Proof. By Proposition [21.2.2] L does get sent to a separable extension. 

First we show the map is surjective. Given //k separable, choose { so that | = k() and 
choose f so that f be the minimal polynomial of 8. Since # is a simple root of f, by Hensel’s 
Lemma [20.5.1] we can lift it to a root a of f. Then K(a) is mapped to k((). 

Part (2) is clear. For (1), if L C M then clearly | C m. Conversely, suppose 1 C m. Now 
LM is also unramified (Proposition and has residue field /-m =m. Hence, 


[Aa = | = lek) = [ns Fk) = |b |, 


showing L C M. 

If / = m, then the above shows that L = M. Hence the map is injective. The action on 
maps L; — Lz is self-explanatory. 

For (3), note an extension is Galois iff it is the (minimal) splitting field of a separable 
polynomial f. Take g to be the minimal polynomial of a primitive element a; note @ generates 
l/k. Note by Proposition g is separable. If L/K is Galois, then g splits over L so 
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g splits over /. Combining the previous two statements, //k is Galois. Conversely, suppose 
I/k is Galois. Since g splits into nonrepeated linear factors, Hensel’s Lemma |20.5.2) lifts it 
to a factorization of g. Hence g splits over K into distinct linear factors, showing L/K is 
Galois. 


Suppose fk is a finite field. In this case, the separable extensions |/k are exactly the 
finite extensions. Moreover, we understand what these extensions are; there is one of each 
degree, and we can find the corresponding L/K explicitly. Furthermore, by surjectivity in 
(3), G(L/K) contains a unique element mapping to the Frobenius element in G(I/k); see 
Definition [24]24.1.1 


Lemma 21.2.5: nroot-unram Let (07 be a root of 
f{ XxX =—cH=0 
where a is a unit and p{n. Then K(a)/K is unramified. 


Proof. Let g(X) | f(X) be the minimal polynomial of a. Let L = K(qa) and I be its residue 
field. 

Note that f’(X) =nX"~! # 0 has no common factor with f(X) = X" — a, even when 
reduced modulo p, as p{n anda ¢ p. Hence f(X), and a fortiori g(X), has no repeated 
root in k. Any factorization of g(X) in k gives a factorization of g(X) in K by Hensel’s 
Lemma. Hence g remains irreducible in k[X]|. This shows [l : k] = [LZ : K]. By the degree 
equation, L/K must be unramified. 


Theorem 21.2.6: unram-ram Let L/K be an extension of complete fields with finite residue 
fields. Then there exists a field K C L, C LE such that L,/K is unramified and every 
unramified extension of K contained in L is contained in L,,. Moreover, 


1. L, is obtained by adjoining to K all roots of unity in L whose order is relatively prime 
to q := char(K). 


2. L/L, is totally ramified. 


L 
totally ramified 


Ly 


unramified 


ia 


We call L,, the maximal unramified extension of K contained in L. This is useful... 
Defining this when L/K is infinite 
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Proof. Let L,, be the compositum of all unramified extensions of K contained in L. Then L,, 
is unramified by Proposition [21.2.3] and it contains all unramified extensions of K contained 
in LD. 

For each n not a multiple of p, K(¢,)/K is unramified by Lemma|21.2.5] Letting q = |k|, 
the corresponding extension of residue fields is k(¢,)/k = Foragin)/Fy. We get all finite 
extensions //k in this way, thus all unramified extensions L’/K in this way. Taking the roots 
of unity inside L gives the result. 


3 Ramified extensions 


Definition 21.3.1: Let L/K be a ramified extension of local fields, with g := char(k) = p”. 
We say 


1. L/K is tamely ramified if p { [I : k]. 
2. L/K is wildly ramified if p | {I : k]. 


We seek analogues of Lemma /21.2.5]in for ramified extensions. 

For a prime p of a Dedekind domain A (not necessarily corresponding to a local field) 
let vp denote the corresponding valuation. (That is, if vp(a) is defined such that p’»“ is the 
highest power of p dividing (a).) Note the following two facts. 


1. If pB = B°, then 


2. Ifa; +---+a, =0, then the minimum value of v,(a;) is attained for two indices. 


Definition 21.3.2:  cisensteinar An Eisenstein extension relative to p is an extension K(a)/K 
where the minimal polynomial of a is of the form 


fle) = 2" tay 12" +--+ 
where vpa; > 0 and vpao = 1. 


Theorem 21.3.3: cisenstein-ramitication The prime ideal p totally ramifies in any Eisenstein ex- 
tension relative to p: 


pB= "Pp, P= (f(a), PB). 


Proof. Let 3B8°||p. Note e <n =[L: K]. We calculate the valuation of f(a) with respect to 
5, 


Up(a") = nova) 
up(a.a“) = e+ kord > e, 1l<k<n-1 


Up(ao) = e. 
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Since f(a) = a"+---+a9 = 0, the minimum valuation must be attained for two terms. The 
only way this is possible is if nordy(a) = e. Then ordy(a) = 1 and n = e, as needed. 


Theorem 21.3.4: Let K be complete with respect to a nonarchimedean valuation. The 
totally ramified extensions of K are exactly those of the form AK (a) where a is the root of 
an Eisenstein polynomial. 


Proof. The forward direction follows directly from Theorem [21.3.3 

Conversely, let L/K be a totally ramified extension. Take a to be a generator of the 
maximal ideal $8 of @,. Note ord(a) = 4 since (a)” = p. Note that for any a,_1,...,@ 9, we 
have 


k; 
ord(a,a*) = ord(ax) + = (mod 1), 


since ord(a,) is an integer. Thus, the nonzero terms a,a*, 0 < k <n, have different orders. 
Thus by Proposition [20.2.6] Ana" + +-++++a 9 #0 unless all coefficients are 0. This shows 
that a must have degree n; suppose a” +a,_;0"'+-+-+a9 = 0. Again by Proposition [20.2.6 
the minimum order is attained for two terms. We have 


ord,(a”) = nord,(a@) = 1 


ordy(a,a*) = k ord(ay) + . 0<k<n-1. 


The only way this can happen is if a” and ord(ao) are the nonzero terms with least order. 
This gives ord(a9) = 1, and ord(a,) > 0 for 1 < k <n, i.e. the polynomial is Eisenstein. 


Theorem 21.3.5: Suppose L/K is a totally and tamely ramified extension of degree n. 
Then L = K(a) for some a a root of 


X"—7=0 
for 7 € p. 


Proof. Take 6 € $8. Since L/K is totally ramified, ord,(6”) = 1. Hence 8” = um for some 
u € B%, and £ is a zero of 
g(X) = X” — un. 


Unfortunately, uw may not be in A. However, we show that this polynomial is close enough 
to 
f(X) := X"-u'r 


for some u’ € A and proceed as in Theorem to show that the roots of these two 
polynomials generate the same extension. 

Since L/K is totally ramified, 1 = k, ic. A/pA = B/B. Thus there exists u’ = u 
(mod $8) with u’ € A. This means |u’ —u| < 1. Letting aj,..., a, be the roots of f(X) = 0, 


|B — aa|-+- |B — onl = |f(8)] = um — ula| < |r| = Jar] -+- lon 
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so |3 — a,| < |Ja,;| for some j; without loss of generality j = 1. 
Since L/K is tamely ramified, p{n and f'(a,) = naj" has valuation |a,|"~!. Hence 


ay|"-* = | f’(a1)| = |(a1 — @2) «++ (a1 — On). (21.2) 


Note |ja;| = Ju’t|» = lay]; hence Ja, — a;| < Jai|. By (21.2), equality must hold. Hence 
|G — ai| < |a, — a,| for all 7 A 1, and by Krasner’s Lemma |20.8.1| K(ai) C K(8). Since 
both extensions are totally ramified of degree n, L = K(8) = K(aj). 


derivative-valuation 


The analogues of Proposition |21.2.3] carry over exactly. 


Proposition 2 1 ° 3.6: ramified-props 


1. Suppose that kK C L C M are finite extensions. If M/Z and L/K are tamely ramified, 
then M/K is tamely ramified. 


2. Suppose that kK C L, M are finite extensions. If L/K is tamely ramified, then LM /M 
is tamely ramified. 


3. Suppose that kK C L, M are finite extensions. If L/K and M/K are tamely ramified, 
then LM/K is tamely ramified. 


Theorem 21.3.7: Let K be a field with characteristic 0 and finite residue field, and let p 
be a prime in @x. Given n, there are only finitely many extensions of Ky with degree at 
most 7. 


Proof. First we show that there are finitely many totally ramified extensions of degree n. 
Every such extension is realized by adjoining a root of an Eisenstein polynomial of degree 
n. By taking the coefficients, an Eisenstein polynomial can be identified with a point of 


poly-compact J) xr XK p x A”. (21.3) 
a 


n-1 


The topology given by ||-|| is exactly the product topology here; this is compact by Propo- 
sition Now for each polynomial f, by Theorem [20.8.2] there exists an open set U; 
such that any g € Uy has roots generating the same extensions as those of f. Since 
is compact, a finite number of Uy cover f. The roots corresponding to those f generate all 
the totally ramified extensions of degree n. 

By Theorem Any finite extension L of degree n is an totally ramified extension 
of degree ” of an unramified extension L,, of degree m for some m. By the remark after 
Theorem [21.2.4] there is exactly one unramified extension of degree m; for each L,, by the 


above there are a finite number of possibilities for L. 
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4 Witt vectors* 


We know from Proposition 0.4.5]that every element of A can be written as en>N Ant” where 
the a, come from a fixed set of representatives for A/m. Although this allows us to write down 
any element, unless the set of representatives is closed under addition and multiplication (i.e. 
form a copy of k in A), we cannot simply add and multiply the coefficients. Instead, we find 
that addition and multiplication are governed by Witt vectors. We will actually develop this 
theory in a more general context. 


Definition 21.4.1: Let p be a prime number. A ring R is a strict p-ring if R is complete 
and Hausdorff with respect to the p-adic topology, p is not a zero-divisor in R, and the 
residue ring R/(p) is perfect. (A ring of characteristic p is perfect if the map x +> 2? is 
bijective. ) 


We will primarily be interested in the case where FR is an unramified extension of Zp. 


Theorem 21.4.2: Let K be a perfect ring of characteristic p. 
1. There is a strict p-ring R with residue ring A, unique up to canonical isomorphism. 


2. There is a unique system of representatives tT : K — R, called the Teichmiller 
representatives, such that 
T(2y) = T(2)r(y) 
for allz,y € K. 


The main example of interest to us is the following. 


Example 21.4.3: Fix f; then there is a unique unramified extension of Z, with residue 
field F,, ¢ = p!, namely Z,[¢,+_1]. The Teichmuller representatives are the (q — 1)th roots 
of unity g—-1. They are multiplicative, but not additive. The following construction will tell 
us how to add them. 


Lemma 21.4.4: witt-operations Given X = (Xo, X1,...), define 

W,(X) = XB + pXPP +--+ p"X,, n>0. 
Then there exist polynomials 

Sos 01, e+ 9h Poy ees = AMG Ay 6245 YO, YI5 020! 


such that 


where X = (Xo, X1,- : il y= (You tes : i = (So, 51, -- aN and P = (Fag Piss : ule 
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The motivation for defining these polynomials is that they tell us how to add in strict 
p-rings using the base-p represenation with Teichmiiller representatives as coefficients. 


Theorem 21.4.5: Let R be a strict p-ring, k its residue ring, and tT : k + R be the system 
of Teichmiiller representatives. Then 


cee i cas i sas —n —(n-1) —n —(n-1) 7 
> (en) = S- T(Yn)P = baton ca: iy gee iy Yi gear ie »Yn)P * 
n=0 n=0 n=0 


Proof of Lemma We will abbreviate 
W(X) = (Wo(X), Wi(X), .--) 
RDG Misch Vo. Vad 


All comparisons between X, Y will be done componentwise, and we define X” = (Xj, X7’,...). 
We find the S,,,, Py inductively, with the additional condition that S,,, P, are polynomials 
in Xo,...,Xm,Yo,---; Ym. To begin, note Wo(X) = Xo so we set 


So( X,Y) = Xo + Yo 

P(X, Y) = XoY. 
Lemma 21.4.6: If Fi,,Gm € Rand F,, = G, (mod p) for every m, then 

W,,(F) =W,(G) (mod p*"). 
Proof. First note that for any f,g € R such that f = g (mod p), 
fr =gP (mod p*"). 

The proof is by induction, with the induction step following by the binomial theorem: if 
fr" = g? + ph then 


f? = (gq? + pih)? =g? + (Pen hg” @-Y) + pitt 
a 
pi 
for some k € R. 7 7 
This claim gives f? ” gj; ” (mod p"~J+) and hence 


n— 7 


pf?” = pig?” (mod p™"). 


Summing these up give the result. 


Directly from the definitions, we have 


W,,(X) = Wr_-1(X”) + pv" Xp. 
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Hence the equations 


are equivalent to 


Wr—1(S”) + p" Sp = Wr_1(X”) + p"Xn + Wr_-1(Y”) + p"Yn 
witt-add = W,,-1(S(X?, ¥®)) + po Xa + ¥,) (21.4) 
Wr—1(P?) +p Pa = (Wa-1(X?) + pv? Xn) (Wr-1(Y”) + v°Yn) 
avenues = Wy (P(X?, ¥?)) + p?(X,WralY?) + ¥,Wrai(X?) +p" X,Y,) 
(21.5) 


where (21.4) and (21.5) follow from the hypothesis for n — 1. Solving for S;, and P,, these 
are equivalent to 


W,,-1(S(X”, Y?)) — W,_1(S?) 


S,=X,+Y,+— - 
Dp 
W,,-1(P(X?, Y?)) — W,_1(P? 

P, = XpjWy—1(Y”) + YpWy_1(X") + p”XnYn 4 (P| = i), 
However, since taking pth powers is a homomorphism modulo p, for any f € R we have 
f(X,Y)? = f(X?, Y”) (mod p). Applying this to f = S;,P;, we see the conditions of the 
lemma are satisfied, so the numerators are divisible by p”, and we can successfully define S,, 
and P,,. 


Theorem 21.4.7: Let A be a commutative ring. For 
a= (Gg, @iys++), b= (bo, b1,---), a, 0; € Ay, 


the operations 


a +b= S(a,>), ae 


turn the set AN° into a commutative ring W(A). 


SP (6,0); 


This is called the ring of Witt vectors over A. 


Proof. We first prove that associativity, commutativity, and distributivity hold as polynomial 
identities in the aj, b;. The result then follows by considering the substitution homomorphism 
Z|ao,...;bo,---] > A. 


Lemma 21.4.8: The function W : RN > RN, where R := Z[ao,...;bo,...], is injective. 


Proof. Suppose X = (Xo,Xi,...) and W(X) = (Yo, Y1,...). We show the X; are determined 
by induction. We have Xp) = Wo(X) = Yo. For the induction step, note 


Yn = Wa(X) = XP" + pXP" +--+ +p" Xp; 
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since Xo,...,Xn—1, Yn are determined and multiplication by p” is injective in R, X,, is 
determined. 


Lemma |21.4.4| gives 


Hence W : W(A) > RN is a map that preserves addition and multiplication; moreover, it is 
injective. Its image is a subalgebra of R, since it contains 0 and 1: 


w 
Hence + and “ turn W (A) into a commutative algebra with unit (we are basically “pulling 
back” the algebra structure from RN to W(A) using W). 


4.1 Frobenius and Transfer maps 
5 Extending valuations on global fields 


Theorem 21.5.1: ex-varix Let |-| be a valuation on K and let A be the completion of K 
with respect to |- |. Let ZL = K(a) be a finite separable extension of K, and let f be the 
minimal polynomial of a. 

The completions of L with respect to the extensions |- |/ of |- | are exactly K[X]/(h) as 
h ranges over irreducible factors of f in K. 


Proof. Suppose we are given an extension | - |’. Let L be the completion of L with respect 
to |- |’. 


L = Ka] ——>- L = K [a] 


K ——_>k 


Note A[a] contains a and is complete (as it is a finite-dimensional vector space over a 
complete field), so L = K[a]. Then considering the extension L/K, a is the root of one of 
the irreducible factors of f in K [X]. 

Conversely, given an irreducible factor g of h in K[X], consider K[a’] = K[X]/(g). 


L = K(a)\—>L = K(a’) 


K ————+ Kk 
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The valuation on A extends uniquely to K(a’) by Theorem|20.6.1] Then let K(a) @ K(a’) 
be the map sending a to a’. (This makes sense as the minimal polynomials of a, a’ over K 


A 


are both f.) By the same reason as before, L = K(a), as desired. 


Theorem 21.5.2: compiction-tensor Let AX be the completion of K with respect to a archimedean 
or discrete nonarchimedean valuation |- |. Let L/K be a finite separable extension. There 
are finitely many extensions of |-| to L; denoting them by |-|; and the respective completions 
of L be L;, we have the natural isomorphism 


Proof. By the primitive element theorem, we can write L = Kk (a). Let f be the minimal 
polynomial of a. Let f factor into irreducibles in K[X] as 


f=fi--- tn 


Then 


K @x L&® K x K{e]/(f) = K[x\/(f) = LL tel/() = 


- Thm 15.1] ” 
~~ 
[LL 
i=1 


Note the map in the theorem sends 
a@®b++ (a,b,...,anb), 


where a; is the embedding of a into L;. We now have a way to calculate norms and traces 
in terms of completed fields. 


Corollary 21.5.3: compicte-ntr Keep the same notation as above. Then 
1. Nmz/«(a@) — Tj Nm, ,/%(@). 
2. trp/K(Q) = Tia try, ;R(@)- 


Proof. Using Proposition |14]14.2.3(1) and Theorem |21.5.1| we see 


a’ root of f a’ root of fn 


wnat) TE = (oo) = 2m al 
tuxo)= ES o'=( Oana Do!) =Sirwelo) 


a’ root of f a’ root of fi a’ root of fn 
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6 Product formula 


Lemma 21.6.1: compare-normea-vai Let L/K be a finite extension of number fields, with normal- 
ized nonarchimedean valuations w | v, as in Example|20.2.3} Let | - |/, be w normalized so it 


extends v. Then 
| . lad = | : ae 


Proof. Easy, see Milne pg. 132. O 


Theorem 21.6.2 (Product formula): proauct-formuia For any nonzero a € K, 
II lel =1. 
vEVK 

Proof. 


Step 1: We first show the result for kK = Q. Given n € Q, factor it as n = +]]?<, pj’ where 


p; are all the prime numbers; note only a finite number of the a; are nonzero. Then 


jal = (II al ale. = (II v) (Ii) = 


Step 2: We pass to field extensions of Q using the following lemma. 


Lemma 21.6.3 (Extension formula): extension-formuia Let A’ C L be number fields and let v be 
a place of kK. Then 


Il lal, = [Nmz ic al. 


wiv 


Proof. For a place on L let |- |/,, be the valuation normalized so that it extends v. We have 


[Nm jie al, = Il Nmz,,,/K,(Q) - 
_ / 
= |] |Nmz,,/«,(@)], 
ad. 
= ][ /Nmz,,/x, (a) a“ by Lemma 21.6.1 
=][ lel, by Theorem [20]20.6[1 


Step 3: Since every place on K restricts to a unique place on Q, 


ID lel. = 1 Tel. = ID |Nmeyx (a). “2° 1, 


weve veV wlv vEeV 


where we apply step 1 to Nmz/x (a). 


The product formula will be useful when defining a measure of size independent of scaling 


(see Chapter [39). 
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7 Problems 


1. Let K be a complete nonarchimedean field whose residue field has characteristic p. 
Prove that the maximal tamely ramified (separable) extension of K is 


Ke= kK, ({rm :p{m}) ; 
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Chapter 22 


Ramification 


ramification We seek to generalize the definition of discriminant over Dedekind domains A which 
are not PID’s. To do this we will first define the different, which measure how much we can 
enlarge B so that the image of the trace map is still in A, then define the discriminant as 
the discrepancy between B and the enlarged B, using y4. We will find that the different is 
the (ideal) norm of the discriminant. 

We will see that our definition coincides with our previous definition when A is a PID. 
Fortunately, we don’t have to prove everything from scratch again: by localization we can 
always reduce to the DVR/PID case. 

The main use of the discriminant is to measure ramification: The primes dividing the 
discriminant are those that ramify. On a deeper level, the exponents measure the degree of 
ramification. 


1 Lattices and y 


Definition 22.1.1: Let A be a Dedekind domain, kK = Frac(A), and V a finite dimensional 
K-vector space. An A-submodule X C V is a lattice if it is finitely generated A-module 
and spang(X) =V. 


The most basic example of a latice is a fractional ideal of K. 
We would like to measure the discrepancy between two lattices—like the norm, but 
measured by an zdeal instead. To do this, we first need some facts from commutative algebra. 


1.1 Filtrations of modules 


Definition 22.1.2: A module is simple if it is nonzero and has no nonzero proper submod- 
ule. A composition series of length m is a chain of submodules 


M=M)>M,5D-:--D My = 0 


where M;_;/M; is simple for each i. M has finite length if it has a finite composition series. 
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Proposition 22.1.3: The simple modules are exactly those in the form R/m where m is a 
maximal ideal of R. If M is simple, M = R/m where m= Ann(M/). 


The main theorem on filtrations is the following. 
Theorem 22.1.4 (Jordan-Hélder): Suppose M has a composition series. 
1. (Existence) Any chain of submodules of M can be refined to a composition series. 


2. (Uniqueness) Any composition series of M has the same length; moreover the number 
of times R/m appears as a quotient M;_,/M; in the filtration is invariant. 


We will be applying this when R is a Dedekind domain, so the maximal ideals are simply 
the nonzero prime ideals. 
We also need the following. 


Proposition 22.1.5: If M/M’' and M' have finite length, then so does M. 


1.2 The function y, 
Definition 22.1.6: Let A be a Dedekind domain. Define 


xa: {A-module of finite length} — {ideals of A} 
as follows: Given M of finite length, with composition series 
M=Mo 2M, 5D-:::-D Mm =0 
and A/p; = M;_1/M;j, define 


Example 22.1.7: The primes appearing in the filtration of an ideal a C A are just the 
primes dividing a with multiplicity, so 


Proposition 22.1.8: chiccxact If M’ and M” have finite length and 0 — M’ > M > M" > 0 
exact sequence of A-modules, then 


xa(M) = xa(M")x4(M"). 


Definition 22.1.9: Let A be a Dedekind domain, K = Frac(A), and X,,X_2 C V be A- 
lattices. Choose X3 C X,M X2 any A-lattice and define 


vA(X1, X2) = xa(X1/X3)Xa(X2/X3) 


as fractional ideals of K. 
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Proof of well-definedness. We show this is independent of choice of X3. 
Observe vy 4(X1, X2)x4(X2, X1) = (1). Note this is independent of choice of X3. It suffices 
to show that 
x.a(X1/X3)xa(X2/X3)* = Xa(X1/X4)xa(X2/Xa) 


when X4 C X3. This follows by the exact sequence 
0-> X3/X4 => X1/X4 = X1/X3. 


and Proposition |22.1.8 


1.3 yx and localization 


It is easier to study x4 when A is local; in this case y4(X) is simply a power of the maximal 
ideal. To understand x4 (and hence the discriminant) for general A, we thus consider the 
localization of A at all primes. The following says that y 4 is well-behaved under localization. 


Proposition 22.1.10: chi-exponent Let A be a Dedekind domain and p C A be a nonzero prime. 
Then 
Up(XA(X1s X2)) = Upay (XA, ((X1)ps (X2)p)). 


Proof. Note Xp = Ap: X = Ay @,4 X is an A,-lattice of V. 
Localization is exact, so preserves quotients. Suppose M > N are adjacent terms in the 
filtration of A. If M/N = A/p then 


M,/Np = (M/N)p = (A/P)p = Ap/PAp 


while if M/N = A/q, q 4 p, then M,/N, = 0. Only the quotients with A/p remain; the 
result follows. 


Proposition 22.1.11: Let A be a Dedekind domain with fraction field AK, X an A-lattice 
in V, ando € Autx(V). Then 


x'4(X, aX) = (detc). 


Proof. It suffices to check both sides have the same p-valuation for every prime p of A; by 
Proposition |22.1.10|this is equivalent to 


XAp (Xp, OnXp) = (det On): 


Thus we only need to check the proposition for the case where A is a DVR, hence a PID. 
For all nonzero a € A, 


x(X, ao X) = a"x(X,oX) = det[a] - x(X, 0X); 


note we used x(uX,auX) = a” since X is free over A, and that the matrix of the trans- 
formation [a] is simply aJ. Thus by choosing a such that aoX C X we may assume 
aX) XxX. 
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By the structure theorem for modules, X/aX = A/a; x --- x A/a, for some a,, giving 


xA(X, 0X) = (a1--+ an) = (det o). 


1.4 Discriminant of bilinear forms 


In this section we will define the discriminant of a bilinear form T' on a lattice XY over K, the 
fraction field of a Dedekind domain A. When we specialize to the case that X is a extension 
of A and T = tr, then we get a generalization of our original definition [14]14.3.1] in the case 
where X is not necessarily free over A. 


Definition 22.1.12: Keep the above assumptions. Let V be a finite-dimensional K-vector 
space and T': (V,V) > K be a nondegenerate K-bilinear form. Thinking of T as a map 
V ®k V > K, we get a map 


MT: N°V @KNV OK 
defined by 


wedge-T-detN”T (U4 A+ ++AUn, Wi A+ +*AWn) = S- (1 Tay Away) = 2 (UnAWagp)« (221) 
TES 


Note A"T’ ® A"T is a 1-dimensional vector space over K’, with lattice A°X @x A" X. Define 
the discriminant of T’ on X to be 


OxT = xA(AT, AVX ®&) I 


The main reason for defining the discriminant as above is because the “A” construction 
is natural and makes it easy to prove a few basic properties. 


Proposition 22.1.13: cniact If X is free over A with basis (e1,...,€,), then 
dxr = (det(T(e;, e;))). 


Proof. Note that X ®x X is generated by A”T(e1 A+++ A én, €1 A+++ Aen). By (22.1), this is 
exactly (det(T'(e;, e;))). 


We now give an alternative characterization of the discriminant, in terms of the dual 
lattice. 


Definition 22.1.14: Define the dual of X with respect to T’ by 
Xpi=f{yeV:T(a2,y) € A for allz eX}. 
This is an A-lattice of V. 
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We first need the following. 


Proposition 22.1.15: auatbasis If €],...,€, is a basis for X over A, and ej,...,e* is a dual 
basis, i.e. T(e:,e;) = di; for each j, then ej,...,e7, is a basis for X* over A. 


Proof. Note y € X* iff T(e;,y) € A for each j. Writing y = 77_, aje%, we find T(e;, y) = aj, 
so y € X* iffa; € A for each j, ie. y € span,(ej,...,e%). 


mn 


Proposition 22.1.16: We have 
XA(Xp, X) = Ox,r. 


Proof. We use the fact that a fractional ideal is determined by its localizations at all primes 
(this follows since the exponent of p in a is the same as that of pA, in aAy, Proposi- 


tion |15}15.2.5). 


By using Proposition |22.1.10} we may localize at nonzero p C A. Hence it suffices to 
prove may assume A is DVR, i.e. free over A. 
Write 


En e€ 


where B = (b;,;) is an x n matrix. Then by Proposition |22.1.13} 


Ox = (det(T (ei, e;))) = (det(bi3)) = x(Xp, BX7) = x(Xr, X), 


as needed. 


2 Discriminant and different 


For the AKLB setup with L/K finite separable, consider the nondegenerate K-bilinear map 


tr:Lbx Lok 
(x,y) + trysK (ay). 


Definition 22.2.1: Define the codifferent 
B* := Bi ={y € L: tr(zy) € A for all x € B} 
and the different and discriminant by 
Deja = OL/K = Bye 


OB/A = OL/K ‘= OB er 
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(These are fractional ideals of K.) 
Observe that B C B* (in light of trzs«(B) C A) so B D Opa. 
The following gives the precise relationship between the discriminant and different. 


Proposition 22.2.2: Nmzp/K(9z/a) = 0B/a. 


Proof. We have 
Opa = XA(B*, B) = xa(B*/B) 
and 
Deja = (B*)* = xp(B"/B). 


The result thus follows from commutativity of the following diagram. 


{finite length B-module}*———_-> Ig 


a Neue 
I 


K- 


We have commutativity since if B/SB is a quotient of adjacent terms in the B-filtration of 
M, then when we refine it to a A-filtration, since B/38 = (A/p)//*) as vector spaces, we 


get f(38/p) copies of A/p. 
Note: Grothendieck group. 


2.1 Basic properties 


First, a slightly cleaner characterization of the codifferent. 
Lemma 22.2.3: coaitiem a € Ix and b € J;. Then 
trr/K(6) Ca = 6C aD aia: 


Proof. We check tr(a~'b) C A iff a~!b C Dr iK: 
The reverse direction is clear. For the forward direction, note that if 7 € a-'b and y € B, 
then zy € a-'b and hence tr(xy) € A. This shows x € Deja: 


Proposition 22.2.4: 


1. (Transitivity) Let M/Z be a finite separable extension, with C' the integral closure of 
Ain M. Then 


Doja = Do/sDs/a. 
2. (Localization) For S C A a multiplicative subset, 


S Daya a Ds-1B/s-1A- 


226 


Number Theory, 822.3 


3. (Completion) : 
Deja By = Da, / Ay: 


Proof. 1. We have 
€ € DojpD pa 
—— D Bae Cc Doe 


<> try/1(De/ae) CB Lemma |22.2.3} with M/L 
==> Dpatrmyz(e) CB 
———s truyx(e) € D/A 


<> try (truyr(e)) CA Lemma |22.2.3] with L/K 
> € EDGE). 


2. Omit. 


3. Localize at p. May assume A is a DVR. (B may not be a DVR.) Consider 


ITsp\p By ITsp1p Ly 


|: |x 


B®, A,x— L @x K, 


|prunouk 


A, kK, 
The top-to-bottom map on the right is > ten Kp: Then 


-1 A w -1 
OBIA QA Ay => D a4Ay/Ay 


~N -1 


~ a te Bry / Ap 
=[|o. : 
By/A 
ip p /Ap 


=] 54 Sp By 
Plp 


3 Discriminant and ramification 


Recall ordg (9g, = Ordp(Oe/4). Our goal is to show that epj = 1 and K(B)/x(p) 
separable (i.e. $B is unramified over K, iff Bf 0p/a. 
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In the CDVR case, 


we have B/pB = B/°. 


Lemma 22.3.1: 7 
trz/K(b) mod p = etr/,(0). 


Proof. For all b € B, 
0 = B/Pe CP U'/P C++ C P/P* C B/P. 


Each adjacent quotient is 1 dimensional over / and hence f-dimensional over k. Choose a 
basis {w;}"_, (n = ef) for B/3B° as k-vector space, such that 


span,({wi} fA (7) 741) = PI /P°. 


(The last jf vectors span B°4/P°.) Lift {w;} to w; € B such that w; mod P* = w;. The 
w; are a basis of B over A. Now 


trp/«(b) = tre (ms) 
bw = (bi,5) (Wj) jar 
bw; = (0:3) (W;). 
We have i 
trpjK(b) = So ii mod p. 


i=l 
Now (6; mod P) prri<ij<p(k+1) represents the linear map (multiplication by b) 
Bp Bp 
Pr+1 /Pe Ser l (ye : 


The trace as a k-linear map is tr;/,(b). There are e such f x f blocks. 


Corollary 22.3.2: 
ordy (0/4) >e-l. 


Proof. It suffices to show 
ord»(Opya) 2 (e — If. 
This is since Dg/4 = P° implies 0p/4 = Nmz/K(P°) = P". 
Now 
OB/a = (det trz/K(wiw;)) 
same as in the previous proof. Now w; € Pif f+1<i<n=ef, therefore w; € P/P°. 
For all j, trz/K(wiw;) € BOK = p, giving the result. 
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Now consider the general case. 
Theorem 22.3.3: Suppose A, B are Dedekind. Then 
1. ordy(D pa) > exp/p — 1. 
2. $B is unramified over K iff PB 1 DrK. 
Serre does this by Eisenstein polys. 
Proof. 1. ordy(Dzga) = ord (D p,.4,)- and ex /) = g/g. Use the CDVR case. 


2. For “=”, note ordy(9r/x) implies 0 > eg/yp — 1 i.e. ep = 1. 
For “=>”, it suffices to prove p { 0g/4. Reduce to the CDVR case. Now 


det(try/«(wyw;)) mod p = eg ptri/,(Wiw;) A 0 


if 1/k is separable (Neukirch [.2). 


3.1 Types of ramification 


Definition 22.3.4: ‘PB is unramified if ey/, and //k separable. For PB ramified, 
1. $8 is tamely ramified if either char k = 0 or chark { egy. 
2. $B is wildly ramified otherwise. 
Theorem 22.3.5: ‘8 is tamely ramified over K iff 
ordy (Ox ja) = exp/p — 1. 
Proof. Reduce to the CDVR case. 


Step 1: We show that 8 is tamely ramified iff trz;~(B) = A. Observe that trz/«(B) is an 
ideal of A, so the latter is equivalent to trz;~«(B) (mod p) 4 0. But we know 


trzj«(b) mod p = esxptri/x (0), 
and trj/,(b) 4 0 (not identically 0). Hence ey /) 4 0 (mod p) iff trz/x.(b) #0 (mod p). 
Step 2: trysx(B) = A => ordy(Dzr/K) = ep/p — 1. 
We’ve seen 


trp/K(6) Ca = 6C aD aa: 
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Plug in 6 = B to get, as ideals of B, 


A’ := tr(B) Ca = BCadzi, 
— DrL/K Cc aB 


Write A’ = p*. We have p* | D iff p* | A’ for a € Z. (Power can be rational.) 
For a € Z, ord,(A’) > a iff ord,(D) > a. 
Thus we get 
ord,(A’) < ord,(@r/xK) < ord,(A’) + 1. 
e—————" 
ords; (9) 
“BR /p 
Thus tryp/x(B) = A iff a = 0 iff ordy(D) = e — 1. 
Thus 8 is tamely ramified iff u(D) = e — 1. 


3.2 Computation of different 


Proposition 22.3.6: monogenous When A and B are CDVR’s, B is generated by one element 
over A as an A-algebra: 


B = A[s. 


(We say that B is monogenous over A.) 
Let ZL := Frac(A), K := Frac(B). When L/K is totally ramified, then we can choose 6 
to be any uniformizer 7. 


Proof. Any element of B can be written as 0,59 @%7% Where a, are fixed representatives of 
|= B/(mz,). But we can choose the a, to be representatives of k = A/(7x), since k = l. 


Theorem 22.3.7: (Residue field extension separable.) 03/4 = (fg(8)) where fg(x) € A[z] 
is the minimal polynomial of @ over Kk. 


Proof. 
Lemma 22.3.8: 


Gs )=49 ae 
*L/K \ F773) ate p97 1, 


Proof. The eigenvalues of multiplication by @ are just the roots $1,..., Bn of the characteristic 
polynomial. Note that if A is a linear operator with eigenvalues \; and P is a polynomial 
then P(A) has eigenvalues P(A;). Hence 


Number Theory, $22.4 


Let D(a1,...,2n) = Uj<;(ai — 2;). Noting f’(8;) = I]jz(6i — G;), the above equals 


1 5 isc te) 
D(x, ees Tox) f= 1 TLjve(@i = ,) 
————<$<—- 
P(21,...,0n) 
evaluated at (%1,-..,%n) = (f1,.--,8n). Consider P. Note P is zero whenever x; = x, for 


some i # j (All except two terms are 0; those two cancel.). So x; — x; | P, and D | P. 
However, P has degree less than =U" when k <n —1, so must be 0. If k =n —1 then we 
know P is a constant multiple of D, look at the coefficient of any term to see that in fact 


Pe= DD; 


It suffices to prove 
(f5(8)~') = BY := {DE L: trzx (bv) € A for all b’ € BI. 
The condition inside is equivalent to 
tr(b6)e A, O<j<n-1. 


(because B = @ A{".) But by the lemma, 


Sak die 
tr (= ua) =i Hagetee (2) 


“Triangular.” Therefore 


B= @4 a5 (aa): 


Good exercise: Compute DQ, (¢,n)/Q,- This is tamely ramified only at n = 1. Totally 
ramified tower. The first step is (Z/p)*, tame, everything else is p, wild. 


(Note G(Q(Gp)/Q) = (Z/p"Z)* because Dp = G(Q(Gpr)/Q). 
4 Ramification groups 
Local, CDVR setup. 
Definition 22.4.1: Let 2 > —1. The ith ramification group is 


G;={0 €G:bE Gr, vz(o(b) — b) > 14+ 1} 
={o0€G: v,(o(8) — 8) >i+1}. 
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Observe G_,; = G, that G; > G; for i < 7 and f);s_, G; = {1}. Also note for all 7, G; is 
a normal subgroup of G because 


G,; = ker(G > Aut(@,/(ni*")). 


In particular, G;/G., is a group. Furthermore G; is defined even when 7 is not an integer; 
we have G; = Gra. 
We will study {G;};s-1. We want 


1. A formula for vz(Dz/xK). If at most tame, equals e — 1, else greater. 


2. Look at quotients G;/Gj41. Abelian, cyclic, p-group, prime-to-p? 


4.1 Dy /x and ig 
Definition 22.4.2: Let ¢ € G(L/K). Define ig : G — No U {oo} by 
ig(o) = min {uz(o(8) — 8): B € B}. 
Note that if B = A[s], then 
ig(o) = uz(o(B) — 8). 
Observe 
e ig(o) =o iffo = 1. 


eG, = {0 €G:ig(o) >i+1}, so a € G; iff ig(o) > 1+ 1, so doesn’t depend on choice 
of generator. 


Note 
ig(ror') =i¢g(a), 0,7 EG. 
Because G; <1 G. Note 
ig(oT) > min(ig(c),ig(T)). 
Because 
ig(oT) = vzt(oTB — B) 
> min(vz(or(8) — 7(8)), vr(7(B) — 8) 
= min(ig(c),ia(T)). 
since O, = Ox|8] = Gx[r 6]. 
Proposition 22.4.3: 
W(Dr/K) = Dd tale) = YO(IGi| — 1). 


oAl i>0 
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Proof. Let f(x) be the minimal polynomial for 6. Letting n = [L : Ky], 


n 


f(o) = TI(X - 8). 


Now 


Take uz for (1). 


For (2), consider multiset 


LI(Gi\{1})- 


i>0 


finite. Note o € G appears in Go, Gi,..., Gig(c)-1, there’s ig(a). Compute the size of the 
multiset in two different ways 


d(IGl — 1) = > te(¢). 


i>0 o#l 


Remark: vp(Dr/xK) =e —1 iff Gy; = {1} (because |Go| = e, iff L/K is at most tame. 
Let’s understand D,/«, ig under sub and quotient group. Consider L/ LK. 
First, sub. 


Proposition 22 . 4.4: ram-grp-sub 


in(o) =ig(c) for allo € H 
Hy =H 1G 


Proof. Same generator works for larger ring. O, = @x|6] => Or = Gx |B]. Then true by 
def. 


Corollary 22.4.5: 
vi(DrjK) = >> ig(o). 


o#1,c€¢H ~~ 
tH(c) 


For quotient. 
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Proposition 22.4.6: isis For HIG,G4#1¢«G/H, 
1 


ig/H() = 
€L/K' o€G,o mod H=a 


Corollary 22.4.7: 
1 
UK/K(DRyK) = 5 S- ic(o). 
€L/K o¢H,ocG 


Because by prev. 30344 1a/H(@) equals RHS by prop. 
Proof. Choose a € @x and 6 € Gz such that Ox = Ox[a'] and GO; = Oxf]. Then 


er/Kig/H(O) = er/KUK (Ta! — a’) 
= ui (aa’ — a’). 


qH —~— 
vi(oTB—B) 


=o; (11 (o7(B) — 3)) = before. 
TEH 
fixing o. 
It suffices to prove (ca! — a’) = TL en (o7(8) — 8). Call LHS, RHS a, b. 


1. a| 6: Consider 
9(X) = Tf (X — 7(8)) € Orcr[a]. 


TEH 


og(X) = [] (X —o7(8)). 


TECH 


Observe ga’ — a’ divides coefficients of og(X) — g(X). Because for all a € Ox, 
a=aot+ aja’, c® = a9 + aoa’ +--+. Note aa’ — a’ | ca" — a". Note g(8) = 0. Take 
x = 6 to get 
oa! — a! | og(B) — 9(8).- 
“sn 
0 
2. 6 | a. SWITCH f and g below. Cook up a minimal polynomial to show divisibility. 
a’ = Ox[P| = Gr. Write 


n-1 
a! = )7 af" =: 9(8). 
i=0 
a; € Ox. g(X) € Gx[X]. Consider g(X) — a’ € Ox [|X]. By construction has § as a 


root. 
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Hence plugging in x = £6, 


F(X) |g(X)—a 
of(X) | o(f(X)) — a(a’) 
of(8) | f(B) — o(a’) 


giving +6 | +a. 


End of unedited stuff. 


4.2 Filtration of ramification groups 


sec:ram-filt We know from (??) that 
C4052 Cline = Ci, 


In particular, if k is finite then G_,/Gp is finite cyclic and if k = k then G_,/G is trivial. 
From now on assume 2 > 0). 
We aim to study the filtration 


iainiication Aimatian-eg a: > Go =) Gy > Ee Se (22.2) 


To do this, we first study the filtration 


vnitetlerstioaagls” > ue > Us > rae (22.3) 
where 
2 VOR: i=0 
— 14 riO, i>. 


The quotient groups in can be understood explicitly (Proposition (22.4.8). We will 
relate the two filtrations by Proposition [22.4.10}! From this we get several important corol- 
laries about the structure of the groups G,. Understanding conjugates and commutators of 
elements in the G, gives us several more important properties. 


Proposition 22.4.8: units-titration Let AK be a complete field with discrete valuation (for in- 
stance, a local field), & its residue field, and m the associated maximal ideal. Then we have 
isomorphisms 


Un (Uy? = k* CO oe Sak 
ury+u (mod m) l+an”"'+a (mod Mm). 


'This will be important in local class field theory, which says there is a canonical isomorphism 
K*/Nmz/x(L*) = G(L/K) if L/K is finite abelian. 
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Proof. For the first just note that 1+ m is the multiplicative unit of A/m. For the second, 
note (1+ an™)(1+ br”) =1+(a4+b)n"+---. 


To construct a map G;/Gj41 > Ui/U;"", we first need the following characterization of 


Ge 


Lemma 22.4.9: si-cuotient-crit Suppose L/K is a finite Galois extension of local fields, 7 is a 
uniformizer of L, and G = G(L/K). For i € No and a € Go, 


gi-quotient-crit-eqT € Gy <> “7) =1 (mod 7%). (22.4) 
Proof. The RHS is equivalent to 
gi-quotiont-criti (7) — 77 =O (mod a"), (22.5) 
We need to show this is equivalent to 
gi-quotient-crit2(8) — 8 =0 (mod ait") for all 8 € L. (22.6) 


It is clear that (22.6) implies (22.5). 


First suppose L/K is totally ramified. Then G; = @x{r] by Proposition [22.3.6] giving 
that implies (22.6). 

Now consider the general case. We know L/L/“/* is totally ramified (Theorem|15]15.7.2), 
so the theorem holds for L/L/+/*. Now, by Proposition 22.4.4] G,(L/L'/*) =GiN Inn = 
G;. Furthermore, since 7; is the same for L/K and L/L*“/*, the right hand-side of 
does not change whether we are talking about L/K or L/L'“/*. Hence the theorem for 
L/L'“/* implies the theorem for L/K. 


Proposition 22.4.10: ramiscation-to-unit-group There is a well-defined injective group homomor- 
phism 
4; : G;/Gis1 = ioe 
o(r) 
1 


Ot 


that is independent of the choice of uniformizer 7. 


Proof. Note that 
ram-unitlU E C1, Oo E G; => o(u) =U (mod m) => au) — uP. (22:7) 


First we show 6; is a group homomorphism Gi; —> U} /U;*!. We have 
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o( 2) . 
Since rn) € O, and tT € Gi, (22.7) gives = ) E ue. 
Lemma 22.4.9 gives that the kernel is exactly Giii, so 0; induces an injective map 
Gi /Gis1 —> Le ue, 
Now suppose 7’ is another uniformizer. Write 7’ = uz with u € Of. Theno € G; 


and (22.4.9) give 


Corollary 22.4.11: 1. Go/G; is finite cyclic. 
2. If char(/) = 0 then G = {1}; if char(l) = p £ 0, then for each i > 1, 
Gi/Giz1 = (Z/pZ)™ 
for some n,. 


Proof. 1. Proposition |22.4.10] and |22.4.8] give Go/G, © U,/U; & I*. But any finite 


subgroup of a finite field must be cyclic. 


2. For ¢char({) = 0, a has no finite nontrivial subgroup. For char(l) = p, we have 
G;/Giz1 @ Ui /Uit! = 1+. Just note I+ is a an abelian p-group. 


Corollary 22.4.12: cor:G(local)=solvable Go a Ink is solvable. If G(I/k) = G_1/Go is solvable 
(in particular, if k is finite) then G is solvable. 


Proof. The series 
GIGS 


is a solvable series for G. 


4.3 First ramification group 


Recall that we defined Go = Iy/x so that we can split L/K into two parts: L/Lt1*« is 
totally ramified while L/“/*/L is unramified. We can further split the extension L/L/*/« 
into a wildly ramified and tamely ramified part. 
Definition 22.4.13: Define the wild inertia group and tame inertia group to be 

Gi =Ijx 


Go/Gi = Ipik: 
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Theorem 22.4.14: The extension L/ LIti® ig wildly ramified with Galois group G = Te 


and the extension L!/* / Ltik ig tamely ramified with Galois group G,/Go. 


wild ramification : : I Te 
wild 


DLLs 
tame ramification : : I Lk 
[1L/K 
unramified : : G/IpjK=G(t/k) 


nae 


Moreover, G, is the unique p-Sylow subgroup of Go, and 
Go = Gy x Go/Gi. 


Proof. Note Go/Gi — k* while G;/Gj41 < kt for 7 > 1; we have p { |k*| while |k| is a 
power of p; and |Gi| = Th<j<oo |Gj/Gjri|. Hence Gi is a p-SSG of Go; it is unique since 
it is normal and all p-SSGs are conjugate. Since the indices of the field extensions are the 
orders of the Galois groups, the result on tame and wild ramification follow. 

Now we prove the semidirect product. This follows directly from the Schur-Zassenhaus 
Lemma: If H is a normal Hall subgroup of a finite group G, then H has a complement, and 
hence G = H x G/H. (A Hall subgroup H C G is a group such that gcd(|H|,[G: H]) = 1.) 

The following is an alternate proof. We show the exact sequence 


1 Gi Go >Go/Gi — > 1 
Tie Tay iia 


splits by showing there exists a right inverse Go/G — Go of the projection Gp > G}.? Since 
Go/G; is cyclic of order r := |I*|, it suffices to find a lift 7 € Go of the generator 7 € Go/G, 
with order r. Write |Go| = p*r. Let 


e(r)t 
aaa 


where ¢ is such that y(r)t > s. Note r { p implies p*”* = 1 (mod r). Since a” € G4, this 
implies o is still a lift of ¢. Moreover y(r)t > s gives that its order is r, so it is the desired 
litt. 


Proposition 22.4.15: conjugate-rg For 1 > 1, Oo — Go, T — Gi/Gis, 
0;(aTa~') = O9(c)'0;(T). 


?The image of Go/G is a complement Q of G, in Go; the elements of Q act on G by conjugation—this 
is what the semidirect product means. 
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(Here 69(c)' is thought of as in U,/U} & 1%, and 0,(r) € Ut /Ui* SI.) 


Proof. It is slightly more convenient to work additively rather than multiplicatively, so we 
consider 


8; : Gi/Gizr 4 UL/UL = (n')/(n"™) 


o(7) f o(n) 0 
eae re ma 
T {ol ean 


where 7 is any uniformizer. 
Define 


and let a € Of be such that 


Note that 
O.(7) = 


Now we calculate, modulo (7)'*, that 


Gi(ero ') = ————"- = 1 


es since g(a) =a_ (mod 7") 


= (22) ae 


= 06(7)'64(7). 


Proposition 22.4.16: If 0 € G; and tT € Gj, i,j > 1, then 


oto '7* € Gis jar. 


Proof. 
Corollary 22.4.17: For 72> 1, 


ota tr te Gir —> of e€G,orreE Gi4t. 
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Proof. We have 


ota tr te Gur — ota }=T in G;/Gis 
> Hota") = HT) in (n°) /(a*t1) 
<=> 61(7)(05(c)' — 1) =0 by Proposition [22.4.15] 
= § (7) =o tio) =1 
<> TE Gi or 0’ € G4. 


Corollary 22.4.18: jump-muttipie-go/e1 Suppose G is abelian and |Go/G,| {7. Then G; = Gi41. 


Proof. Write Go/G, = (7) where r = |Go/Gj|. Since r { i, o 4 1; for any lift o € Gp of G, 
ao’ ¢ G,. Since G is abelian, we get for all tT € G;, ata !7~! = 1. By the previous corollary, 


noting ao’ ¢ G,, we must have T € Gi41. 


Definition 22.4.19: A jump for L/K is an integer 7 such that 
Gi F Gist. 


Corollary |22.4.18}tells us that jumps are divisible by |Go/Gi|. 


5 Herbrand’s Theorem 


5.1 Functions y and w 


Note that ramification groups behave nicely under taking subgroups (i.e. passing from 
M/K to M/L), by Proposition However, the indices are screwed up when passing 
to quotient groups (i.e. passing from M/K to M/L). We calculate exactly how the index 
changes (Herbrand’s Theorem 12.1}, and use it to define a different numbering scheme that 
is invariant under passing to quotient groups. 

It is important to know how ramification groups behave under quotients because this 
gives a compatible system that allows us to look at larger and larger field extensions, i.e. 
pass to the inverse limit. 


Definition 22.5.1: Define yr/K : Roo > Rso by 


~r/K(u) = I G. ea dt 


recall G,, = Gy,)) and extend yr/xK to Rs_;  Rs_, by 
fu] / > > 


piu) =u, —-l<u<0. 
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This is a piecewise linear increasing function with yr/«(—1) = —1 and with derivative 


at least Gal’ so it is a bijection. 


Definition 22.5.2: Define wz/% : R-1 > R-1 by Yz/K = Or IK: Define the upper num- 
bering filtration by 


G":= Gag) U2 —1. 


5.2 Transitivity of » and w 


The function yr/K gives the reindexing when we pass to the quotient Galois group. 


Theorem 22.5.3 (Herbrand’s Theorem): terbranatnm Let L/K'/K be finite Galois extension 
with separable residue field extension. For all u > —1, 


G,,H/H = (G/B) 


Pryswi(u)* 


Here, G, is the ramification group of L/K and (G/H),, yqi(u) 1S the ramification group 
ot KK, 

We will need several lemmas. First we relate the function ig/#(@) and ig evaluated at 
the lifts of in G. 


Lemma 22.5.4: ignie2 For o € G/H, j(7) = maxcesu ic(o), 
taju(@) — 1 = vr (J(@) — 1). 


Thus applying yr/xK has the effect of “turning” tg into ig/y. By writing out the criterion 
for a € G, or (G/H), in terms of ig and ig/z, respectively, we will get Herbrand’s Theorem. 


Proof. Pick 09 € G mapping to @ such that ig(oo) = j(7). Then by Proposition |22.4.6| 
1 


; 1 , 
herbrand-pfllG/H (c) = S- ig(o) = S> iq(ooT). (22.8) 
CL/K' ceaH CL/K' reH 


We claim that 
te(ooT) = min(te(o0), t¢(7)) = min(7(@), te(7)) 
for all t € H. Indeed, by the nonarchimedean inequality, 
ig(ooT) = vz(oT (8) — 8) 2 min(vz (07 (8) — 7(8)), vz(7(8) — B)) = min(ig(o0), ig(7)). 
Consider two cases. 
1. i¢(t) = in(T) > ig(oo). The above gives 
ig(oor) > min(ig(a0),i¢(T)) > te(ao). 


Equality holds by the maximality assumption on do. 
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2. ig(T) < ig(oo). Then 
ig(ooT) = min(ig(o0),ig(T)) = ie(7). 


The RHS of (22.8) then equals 


the next lemma. 


en Min(ig(oo),ig(T)); the result then follows from 


i 
ensK! 


Lemma 22.5.5: ; 


CL/K o€G 


~i/K(u) = min(j(o),u+1)—1. 


Proof. Since both sides are piecewise linear functions, and both sides equal u for —1 < u < 0, 
it suffices to show their derivatives (slopes) are equal for u > 0. 

If7—1< u<iwherez EN, then the slope of the LHS is (EXEAR For the RHS, since 
ig(c) is an integer, each term is either ig(a) or u+ 1; each term where u+1 is the minimum 
contributes to the slope. Hence the slope on the RHS is 

1 IGi| sl 


Gg €G:ut+1 <ig(o)}/= G0 €G:iglo) =i4+1}|/= = : 
are a(o)}| ora a(c) } ce nl 


as needed. 


Proof of Theorem|22.5.3 We have the following string of equivalences. 
1.¢€G,H/H =G,/GunH 
2. There is o € G lifting 7 so that a € Gy. 
3. je(7) -1>u. 
4. pryKi(Je(F) — 1) 2 Yrjxi(u). 
5. teyu(@) — 1 > pryx(u). 
6.6 €(G/ A) 5, iil: 


We have (3) <= (4) because yz/x« is monotonically increasing and (4) <= (5) by 
Lemma [22.5.4 


Now we prove transitivity for y and w. 


Proposition 22 . 5 6: phi-psi-trans 


PL/K = PK'/K ° PL/K' 
WL/K = WL/K! ° WK'/K: 
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Proof. It suffices to prove the first equation; the first implies the second since y and w are 
inverse. For —1 < u <0 both sides equal u. Thus it suffices to show the derivatives of both 
sides are equal for u > 0. For u ¢ Z, the derivative on the LHS is 


; 1 
Yr/K(U) = (ene 


By the chain rule, the slope on the RHS is 


/ ! (G/F) 0, jcr(u)| | H,,| 
Pex (PL/K(U)) Pr Kx (U) — \(G/H)o| |Ho| 


G,H/H||Ay 
= |GuH/H||Hul by Herbrand’s Theorem [12.1] 
EK'/KEL/K! 


G,/H OG, ||H, 


€L/K 


G,| 
Gol 


using HG, = H,, (Proposition |22.4.4) and multiplicativity of ramification index. The 
derivatives are equal, as needed. 


Finally, we prove the most important consequence of Herbrand’s Theorem: namely, by 
using the upper numbering (i.e. numbering using the inverse of yz/«), quotients of ramifi- 
cation groups are preserved. 


Proposition 22.5.7: For all v > —1, 
GH/H =(G/H)". 


Proof. By Herbrand’s Theorem and transitivity of ~ (Proposition |22.5.6) (Wr/K = 
WL/K! fe) WK'/K); we get 


G°H/H = Gy, n(x) H/H 
= (G/F) 95 cirxv)) = (G/A) vee) = (G/A)E 


We can now define upper numbering for infinite algebraic extensions L/K. 
Definition 22.5.8: Define 


G(L/K)":= im G(K"/K)”. 


K’'/K finite Galois 
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6 Hasse-Arf Theorem 


We have two different filtrations, the lower numbering filtration {Gy }u5-1 and {G°},>-1. 
Definition 22.6.1: A jump is u such that G, #4 G,,- or v such that GY 4 G’*. 


They are the x and y-coordinates of jump points, i.e. where the slope of y changes. 
Note a jump u € Z since G,, = Gy,;. Moreover, u is a jump for the lower numbering iff 
v = y,/K(u) is a upper numbering, because vy, W are monotonically increasing. 


Theorem 22.6.2 (Hasse-Arf Theorem): If G is finite abelian, then the jumps v are integers. 
In the cyclotomic case, G was abelian. 

Remark 22.6.3: There is a nonabelian example where v ¢ Z. (See HW.) 
We postpone the proof. Applications. 
1. Used in local class field theory. 


2. “Conductor of Galois representations” are in Z, not just in Q. Finite L/K, G(L/K) > 
GL, (C). 
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Geometric algebraic number theory 


zant In this chapter we answer the following two questions. 


1. Suppose, for every place v, we are given positive reals a,, all but finitely many of them 
equal to 1. How many elements of x € K satisfy 


[tly < a 
for every v? 


2. Given a generalized ideal class R (to be defined) and a number L, how many ideals 
a € RK satisfy Sta < L? (What are the asymptotics as L — oo?) In particular, how 
many integral ideals satisfy Na < L? 


The first question is known as the Riemann-Roch problem for number fields, because it is 
analogous to the Riemann-Roch problem in algebraic geometry!: Given a curve C, and an 
integer ap for every point (all but finitely many equal to 0), what is the dimension of the 
space of functions f with 

ord,(f) > —ap 


for every P? (ord,(P) is the “order” of the zero of f at P.) 

The second question is important because the answer will appear again when we define 
L-functions (because L-functions involve a sum over all ideals). This will allow us to get 
“explicit” formulas for quantities of interest (class number, regulator). And because it’s not 
much of a detour, we might as well answer the first question as well. 

Our technique will be similar to that used in Chapters [16] and 


1 Generalized ideal classes 


Also talk about adeles and stuff. 


'We will not attempt to draw a parallel in our discussion. The reader interested in seeing the correspon- 
dence should consult Neukirch [Neu99]. We follow Lang [Ean94], Chapter 6. 
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Proposition 23.1.1: geniaciass We have the following diagram 


where each square 


I(c) —— I 
K(c) P(c) ie 
| 
U —>+UK,—— P. 
i. 
U. —— K, 


- 


means C = y~1(D) and A/C & B/D. 


Proposition 23.1.2: The group of c-ideal classes has order 


_ 2 TI). Tp) (1 = a) 
: [U : U,] 


where r(c) is the number of real places dividing c. 


We will define the totient function by y(c) = 2’ TI pico Mp) (1 _ it) 


2 Counting lattice points 


Definition 23.2.1: A subset TC 


number of Lipschitz maps y; : [0, 1|* + T whose images cover T. 


Theorem 23.2.2: Let LC 


R’ a lattice with fundamental domain F and D C 


R is k-Lipschitz parametrizable if there exist a finite 


R™ a subset 


whose boundary is (N — 1)-Lipschitz parametrizable. Then 


|{e €L:a2 €tD}| = Vol(D)Vol(F)t® + O(¢*~"). 
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3  Riemann-Roch problem 


4 Asymptotics of generalized ideal classes 

Definition 23.4.1: For a generalized ideal class  € I(c)/P,, let 
j(K, t) = {ae RK: Na<t}. 

Theorem 23.4.2: 


2" (2m)* Ry 


(A) =e 
j(K, t) eae 
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Chapter 24 


Class Field Theory: Introduction 


intro-cft We give the main theorems of class field theory, deferring the proofs to the next five 
chapters. In this chapter we’ll focus on the motivation and intuition behind the theorems. 
The reader may find it helpful to read this chapter along with Chapter [29| Applications. 

In Section [1] we'll introduce the Frobenius map, which we need before we can state the 
theorems of class field theory. In Section [2] we state the theorems of local class field theory. 
We state two formulations of global class field theory: using ideals in Section |4} and using 
ideles in Section [6] after giving the relevant background on ray class groups and ideles. The 
formulation using ideals is less sophisticated to understand, but the formulation using ideles 
is more useful theoretically. We'll compare the two formulations in Section Finally, 
we'll present a proof of the Kronecker-Weber Theorem using class field theory in Section 
Throughout, we’ll refer back to the cyclotomic case, because class field theory is easy to 
understand in this case, and it already shows much of what’s at play. 


1 Frobenius elements 


sec:frobenius-elements In order to define the Artin map and state the main theorems of class field 
theory, we first need to understand the Frobenius map. This map takes prime ideals inside 
a field K to automorphisms in a Galois group G(L/k). One reason for studying the Frobe- 
nius map is that Frob;/«(p) gives information on how the prime ideal p splits in a Galois 
extension. First, we’ll define the Frobenius element and explain what it tells us about the 
splitting of primes. Next, we’ll look at the example of a cyclotomic extension, which suggests 
that something deeper is going on with the Frobenius map, which we'll attempt to explain 
with class field theory. 

The reader may wish to review Section on the decomposition and inertia groups. 

The results in this section will apply to both local and global fields. 


Definition 24.1.1: toveniusciement Let L/K be a Galois extension with Galois group G, and 
assume that the residue field k is finite. 
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1. Let $8 be an unramified prime of L. Define the Frobenius element 


Frobp«(®) = (PB, L/K) 


to be the element o € Dz /«(§B) C G(L/K) that acts as the Frobenius automorphism 
on the residue field 1 = @,/8 fixing k = @x/p. In other words, letting k = Fy, 


oa =a?! for alla € l. 


2. Let p be an unramified prime of kK. Let S8 be any prime dividing p, and define 
Froby/«(p) = (p,L/K) to be the conjugacy class of (8,L/K). Equivalently (see 


lemma |24.1.2), 
Frobrx(p) = (p, L/K) := {(B, L/K) | Pip}. 


In the local case, when there is only one prime, we will simply write Frobz/x. 


Proof of existence of (98, L/K). When p is unramified in L, [(98) = 1 so from Corollary}15}15.7.6} 
the map Dz/x« (8) > G(I/k) is an isomorphism. Thus there is a unique element of Dz /«({B) 
whose image is the Frobenius element. 


To show the above definition is valid, we need to show that changing the prime above p 
corresponds to conjugating the Frobenius element. 


Lemma 24.1.2: lem:frob-lem Let T E G(L/K). Then 


D(rB) = tD(P)r™ 
(TB, L/K) = 7B, L/K)r™. 


Therefore (since G(L/K) operates transitively on the primes dividing p), the conjugacy class 
of (PB, L/K) is equal to {((P, L/K) | Pip}. 


Proof. The first statement follows from the fact that if G acts on S and G is the stabilizer 
of s € S, then tGt7! is the stabilizer of ts. Recall that the decomposition group D() is 
defined as the stabilizer of 8. 

For the second statement, let g = |k| and note that 7, as an automorphism, preserves 
qth powers. Hence for all b € @,, 


(7(P, L/K)r~")(b) = r(7*(a)?) = a? (mod 7(P)). 


Note that if G is abelian, then the conjugacy classes are just elements, so we can think 
of (8, L/K) as an element of G(L/K). 

One of the most basic applications of the Frobenius map is to the splitting of primes in 
an extension. 
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Proposition 24.1.3: tob--spiit-compictey Let L/K be an extension of degree n, unramified at 
3B |p. Then p splits into WGRL/K)) factors, where ((98, L/K)) is the subgroup of G generated 
by (B, L/K). 

In particular, p splits completely iff (p, L/A) = 1. 


Proof. Let | and k be the residue fields. 

The Frobenius element generates the decomposition group D(8), since it acts as the 
Frobenius automorphism on //k and D(B) = G(l/k). Hence |D(38)| = | ((p, L/K)) |. Since 
p is unramified in L, e(B/p) = 1 and f(%/p) = |D(B)| = | ((p, L/K)) |. Hence, letting g be 
the number of primes above p, we have 


n=([L: K]=e(P/p) fOB/P) g- 


— 
1 ((B,L/K))| 
Then 
n 
9 = Wiles TET? 
(OB, £/K))| 
as needed. 


In particular, p splits completely iff g = n, iff |((B, L/K))| = 1, iff |((p, 2/K))| = 1, ie. 
the Frobenius element (p, L//) is trivial. 


Next, we'll need a result of how the Frobenius element changes as we change the base 
field. 


Proposition 24.1.4: pr-irob-base-exs Suppose that L/K is an unramified Galois extension, K C 
K' CL, and p is a prime of K’. Let k,k’ be the residue fields of K and K’. Then 


Frobz/x/(p) = Frobrx(p)*™) 


Note by taking the [k’ : k]th power we mean that if Froby/x(p) is the conjugacy class of 
g, then Frobr/x(p) "I is the conjugacy class of o/*l, 


Proof. By definition, the left hand side induces the |k’|th power map on J, while the right 
hand side induces the |k| - [k’ : k]th power map on /. Hence they are equal. 


1.1 Examples 


We calculate the Frobenius map explicitly in two examples. First, a warm-up. 


Example 24.1.5: For the field extension Q(i)/Q, 


{ complex conjugation, p=3 (mod 4), 


(p, Q(2)/Q) = 1 p=1 (mod 4). 
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Proof. If p= 3 (mod 4), then p remains prime in Q(i). Reference The residue fields are 


I = Z[i]/pZ[i] = Fy 


k =Z/pZ =F,. 


Now (p, Q(z) /Q) must induce the pth power map on ¢ = F,2. Since this is not the identity, 
it must be the only element of G(Q(i)/Q) that is not the identity, i.e. complex conjugation. 
(This does act as the pth power, since recalling p = 3 (mod 4), (a+ bi)? = a? + bPi? = a— bi 
(mod ).) 

If p = 1 (mod 4), then p splits in QJi], say into $B; and By where $B), PB. are complex 
conjugate. Then Z|i]/%8, = Z{i]/B2 = Z/pZ = F, so the extension of residue fields is 
trivial and the Frobenius automorphism is trivial. It is induced by the identity map, so 
(p, Q(7)/Q) = 1. (Note that in this case the decomposition group is trivial and does not 
contain complex conjugation.) 


We generalize the above example to cyclotomic extensions. 


Example 24.1.6: cyctotomic-frobenius Let K = Q(¢,) where ¢,, is a primitive nth root of unity. 
Then G(K/Q) © (Z/nZ)* by identifying k € (Z/nZ)* with the automorphism sending ¢, 
to ¢* (Proposition |19.3.1). 

Suppose o := (p,L/K) is the map ¢, + ¢*. By definition o reduces to the pth power 
map on the residue fields, so o(¢,) = ¢? (mod p@x). Hence 

Ch=Gi (mod p@x). 

But since p { n, the nth roots of unity are distinct modulo p. (More precisely, they are 
distinct elements of Fm where m is such that p™ = 1 (mod n).) Hence we must have p = k 


(mod n), i.e. o is the pth power map. 
This shows that for a prime p{n, under the identification G(K/Q) = (Z/nZ)*, we have 


(p, Q(Gn)/Q) = p mod n. 


This calculation of the Frobenius elements gives a complete characterization of how 
primes split in cyclotomic extensions. We obtain a simple proof of Theorem |19}19.2.4) which 
we restate here. 


Theorem 24.1.7: thm:cyclotomic-factorization-p-2 Suppose that n= pm, where Pp uf m. Let 


} =0rd,,4(p)- 


Then the prime factorization of (p) in Q(¢,) is 
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where ‘8; are distinct primes, each with residue degree f over Q, and g = ru) 


In particular, 


(p) splits completely in Q(¢,) iff p=1 (mod n). 


Proof. For r = 0, i.e. n =m, the automorphism ¢, +> ¢? has order ord,,(p), so the result 


follows from Example|24.1.6/and Proposition |24.1.3| For r > 0, note that (p) totally ramifies 
T9|19.2.2 


in Q(¢,-) by Proposition , and Q(¢,,) is the compositum Q(¢,r)Q(Gn). 


1.2 The Frobenius map is a nice homomorphism 


sec:frob-map-nice Because we’ve defined the Frobenius map on prime ideals p unramified in L, and 
the prime ideals are a free basis for the ideal group, we can extend the Frobenius map to the 
subgroup of ideals generated by unramified primes. Denoting this subgroup by [?, we have 
a map 

catrob-extend Froby x : I 4 G(L/K). (24.1) 


What is nice about this map? Look back to the cyclotomic case, Example |24.1.6, The 
Frobenius map didn’t map the primes arbitrarily; it sent p to p (mod n). What’s to note 
here is that (p, Q(¢,)/Q) only depends on p (mod n), information about p intrinsic to Q, 


even though (p, Q(¢,,)/Q) tells us about the field extension Q(¢,,)/Q. Thus (24.1) factors: 


Frobr/g 


G(L/Q) (24.2) 


Le 


I8/Ig(1, con). 


eq:frob-for-cyclo I 


Here, [3 denotes the prime ideals relatively prime to n and Jg(1, oon) denotes the subgroup 
of ideals generated by (p) with p=1 (mod n) and positive. 

Something like this in fact happens in general: global class field theory tells us that for all 
abelian extensions, the Frobenius map “factors through a modulus,” that (p, L/ A’) depends 
only on what p is modulo a nice subgroup of ideals in K. Our example essentially proves 
class field theory for cyclotomic extensions of Q, by using the roots of unity to “keep book” 
on the action of Frobenius. Don’t be deceived, though, the general case is much harder. 

Before we look at global class field theory, we first study local class field theory. Since 
there’s only one prime in a local field, rather than consider a map from the (rather boring) 
ideal group, we consider a map from the field itself. 


2 Local reciprocity 
sec:local-reciprocity When Kt is a nonarchimedean local field, there is a single prime ideal p = (7). 
For every abelian unramified extension, the previous section gives an element of G(L/K) 


corresponding to p, which we can think of as corresponding to 7. 
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The main theorem of local class field theory is that we can extend this map to all elements 
of K™, and get elements in Di a ice oe G(L/K) = G(K®/K). We will also show that 


this map behaves well under restricting to subextensions L/K. 


Theorem 24.2.1 (Local reciprocity law): tocat-reciprocity For any nonarchimedean local field K, 
there exists a unique homomorphism 


bn: K* + G(K®/K), 
called the local Artin (reciprocity) map with the following properties. 


1. (Relationship with Frobenius map) For any prime element 7 of K and any finite un- 
ramified extension L of K, ¢x«(z7) acts on L as Froby/« (7). 


2. (Isomorphism) Let pz; be the projection G(K??/K) > G(L/K). For any finite abelian 
extension L/K, ¢x induces an isomorphism @,/« : K*/Nmyz/«(L*) > G(L/K) mak- 
ing the following commute: 


Kx ——**_5 G(K*>/K) 


PL 


8 G(L/K). 


~ 


K*/Nmz/x«(L*) 


3. (Compatibility with norm map) For any Kk C Kk’, the following diagram commutes. 


K™ Ox! G(Kl/K" 


Nm x /K @| ab 


K* °* , G(K*/K) 


We can also say something about this map topologically. 


Definition 24.2.2: A norm group is a subgroup of K™ of the form Nmz/«(L*) for some 
finite extension L/K. 

Let Frob denote the Frobenius element of |/k. The Weil group W(L/k) of an extension 
L/K is equal to the inverse image of Frob” under the map G(L/K) > G(I/k). The topology 
on W(L/K) is the topology from considering it as a disjoint union of cosets [(L/K)o,, where 
Oy is any lift of Frob”. 


Note that the topology on W(K@>/K) as defined above is strictly finer than the topology 
it inherits from G(K*?/K) (see exercise 2.1). 


Theorem 24.2.3 (Local existence theorem): tocatexistence Let A be a nonarchimedean local 
field. The norm groups of K are exactly the open subgroups of finite index. 
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Theorem 24.2.4 (Topological isomorphism for LCFT): tnm-ict-topoicey The image of the Artin 
map is the Weil group W(L/k), and $x gives an isomorphism of topological groups K* — 
W(L/Kk). It restricts to an isomorphism Ux — I(L/K). 


Combining Theorems |24.2.1] and |24.2.3] gives the following bijective correspondence. 


Theorem 24.2.5: icé-correspondence Let K be a nonarchimedean local field. Then there is a 
bijective correspondence between finite abelian extensions of K and the set of open subgroups 
of finite index of K*, given by 


Dw Nmz/x(L”). 


Furthermore, this is an inclusion-reserving bijection that takes intersections to products and 
products to intersections: 


Nmyz.1/K((L ° i) = Nmz/x(L*) a) Nmyz//«(L'*) 
Nmypnr/K((L al L)\*) = Nmz/x«(L*) ° Nmy/x(L’*). 


Finally, every subgroup of K~ containing a norm group is a norm group. 


The following gives a sort-of converse statement: nonabelian extensions cannot be de- 
scribed by norm groups. 


Theorem 24.2.6 (Norm limitation theorem): tim:lctt-norm-timitation Let L be a finite extension 
of a local field K, and K’ be the largest abelian extension of K contained in L. Then 


Nmz/x«(L*) = Nmqi/x(k"). 


3 Ray class groups 


sec:ray-class-groups In order to define the Frobenius element of a prime we need the extension to 
be unramified. However, when K is a global field, we cannot as easily say an extension L/K 
is “unramified,” because Gx has many prime ideals. Requiring that L/K to be unramified 
at all primes of K is too restrictive, because most fields L do not satisfy this condition. 

Thus, we instead focus on a set of primes S and consider extensions L/K that are 
unramified outside of S. When we define Frobenius elements, we have to exclude S, and 
when we define a reciprocity map we have to exclude the subgroup that these primes generate. 
(Note that unlike in local reciprocity, we will not define ¢« with domain K*, but rather 
with domain a subgroup of the ideal group Ix.) 

Letting S range over all finite subsets, we will account for all finite abelian extensions 
L/K, because each extension is ramified at only finitely many primes (Theorem [15]15.6.1). 

This motivates the following definition. 
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Definition 24.3.1: i... Let I be the group of fractional ideals of K. Define I? to be the 
subgroup of [x generated by prime ideals not in S. 

Let L/K be an extension of K. Define I? := I?’, where S’ is the set of prime ideals lying 
above a prime ideal in S. 


Note that if S C T then I? D If. 
Similar to Theorem |24.2.1} global class field theory will tell us there is a map 


Te/ Nmzx(Ip) + G(L/K) 


when S contains the primes that ramify in L. However, this is not an isomorphism until we 
take a further quotient, namely, the quotient with a subgroup of principal ideals Px (1, m), 
which we will define. First we need the following. 


Definition 24.3.2: A modulus m is a formal product of places of K, where 
1. Finite primes have exponents in No, and only finitely many exponents are nonzero. 
2. Infinite real places have exponents 0 or 1. 
3. Infinite complex places do not appear. 


We say a place divides m if it appears with positive exponent. We write 


m= J] p® Tp om. 


p finite v real 
—S_EE——sernarreroeesn 
mo Moo 


In other words, a modulus is the product of a proper ideal with some number of real 
places. 


Definition 24.3.3: Let S(m) denote the set of finite primes dividing m. 
Define K(1,m) (“elements of K that are 1 modulo m”) to be the subgroup of elements 
of K* satisfying the following. 


fee —1)>m/(p), finite p | m 


te > 0, real v | m. 
Let i: K* — Ix be the map sending a to (a), and let 
Pr; m) -= i7(K (1, m)). 
Define the ray class group of m to be 


Cx(m) = Te /Pxe(1,m). 
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Note that Px(1,m) € ce because if a € K(1,m) and p € S(m), then ord,(a — 1) > 1 
and ord,(a) = 0, ie. pt (a). We will often abbreviate 1°) as I™, 


Example 24.3.4: cxray-ciassgrours If m = 1 then Px(1,m) is the subgroup of principal ideals 
and C'x(m) is just the ideal class group. 
If m = TI, rea V, then 


Cx(m) = Ix/ {(a) € Ix : ay > 0 for all real v} 


is called the narrow class group. We are only modding out by the “totally positive” 
principal ideals, so it is larger than the class group. 


Definition 24.3.5: A congruence subgroup for K modulo m is a subgroup H such that 
Px(1,m) CH CIR”. 


The corresponding generalized ideal class group is oe /H. 

We will show that generalized ideal class groups are exactly the Galois groups of abelian 
extensions of K. 

Finally, in preparation for the global reciprocity theorem, we say what it means exactly 
for a map to only depend on modulo conditions, like the Frobenius map we considered in 
Section 


Definition 24.3.6: A homomorphism wy : [5 + G admits a modulus if there exists a 
modulus m with S(m) = m such that w factors through I°/Px(1,m). In other words, there 
exists a modulus m with S(m) = S such that 


(Pr(1,m)) = 0. 


4 Global reciprocity 
sec:global-reciprocity In this section K is a global field. 


Theorem 24.4.1 (Global reciprocity theorem): giobatreciprocity Let L/K be a finite abelian 
extension. Let S be the set of primes ramifying in L. There is a unique map ~z/«K such 
that for a prime ideal p ZS, Wr/x(p) acts on L as Frobz/x«(p). Moreover, wz/x« satisfies the 
following properties. 


1. (Isomorphism) wz/x% admits a modulus m with S(m) = S and wr/K induces an isomor- 
phism 
bapa 1S /(Pxe(1,m) “Ninzc(I5)) 3 G(L/K). 
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2. (Compatibility over all extensions) Suppose S C T, and L/K, M/K are finite abelian 
extensions such that Z C M and such that the set of primes ramifying in L, M are con- 
tained in S,7, respectively. Then the following commutes, where pz is the projection 
map. 


i “’G(M/K) 


|p 


13 “VG(L/K). 


3. (Compatibility with norm map) For k C Kk’ C LF, the following diagram commutes. 


Efe 


yp 
18, “¥G(L/K’) 


[sme [ 


1. "“/Q(L/K) 


Remark 24.4.2: rem:gctt-ideais The uniqueness of wy/« is clear from the fact that I? is freely 
generated by prime ideals. Part 2 follows immediately from the definition of ~z/K and ~u/K, 
and part 3 follows immediately from the existence of wWr/~% and Wr/K, as we show below. 
The crux of the theorem is part 1. 

For part 2, since primes generate I>, it suffices to show that for any prime p € I>, 


Wr/K(P) = pt(Wu/K(P))- 


But by definition, the left-hand side is Frobz/x(p) and the right-hand side is py (Frobyx (p)). 
Now py induces the map G(m/k) — G(l/k), so both sides act on k as the |k|th power 
Frobenius, and are equal. 

For part 3, we need to show for any prime p € [?,, 


Wryw (P) = Ure (Nm K(P)). 


But by definition, the left-hand side is Frob,x/(p) and the right-hand side is wy/«(p'"™!) = 
Frobz«(p)**l. The result now follows from Proposition |24.1.4 


Example 24.4.3 (Cyclotomic extensions): ex:cyciotomic-get In Section [1.2| we showed that the 
global reciprocity theorem (part 1 above) holds for a cyclotomic extension Q(¢,)/Q. In- 
deed, letting m be noo, we have that J™/Px(1,m) > G(Q(¢,)/Q) as in (24.2). (Note that 
Nmz/x (I?) C Px(1,m) will follow from the first inequality |28]28.2.1}) 


Note the modulus in Theorem [24.4.1|has to be divisible by all primes ramifying in L, and 
the primes have to have large enough exponents for ker(wWz/«) to be a congruence subgroup 
modulo m. There is a canonical choice for m, namely the modulus with least exponents. It 
is called the conductor of the extension L/K, and denoted by f(L/K). 

We have the following analogue of Theorem [24.2.3] 
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Theorem 24.4.4 (Existence theorem): giobarexistence Let H be a congruence subgroup modulo 
m. Then there exists an abelian extension L/K such that 


H= Px(1,m) ‘ Nmz/« (J?) = ker(qz/K). 
In particular, this applies when H = Px(1,m). 


Definition 24.4.5: afray-ciass-icia For each modulus m there is a field Ky, called the ray class 
field of A modulo m such that qx,,/« defines an isomorphism 


Cx(m) > G(Kn/K). 


Example 24.4.6: Since oo(n) is the smallest modulus such that wevc,)/@ factors through 
Tt /P(1,m), co(n) is the conductor of Q(¢,,). Since we actually have an isomorphism 


Cx (con) = 1%" /Px(1,00n) = G(Q(G,)/Q), 


Q(¢,) is in fact the ray class field of oo(n). 
We have that Q(¢, + ¢7') is the ray class field of (n) (see exercise 1.2). 


Putting this all together, if we fix a modulus m we have the following bijection between 
extensions and subgroups. 


Theorem 24.4.7: thm:ray-class-bijection Fix a modulus m and a global field K. The map L tH 
Nmy/«(Cz(m)) is a bijection between 


1. the set of abelian extensions of K in the ray class field Ay and 
2. the set of subgroups of Cx (m). 
Moreover, it reverses inclusions and switches products and intersections: 
LOM <=> Nmz;x«(Cr(m)) > Nmyx(Cu(m)) 


Nig oR VC cio) = Nm_z,/«K(Cz,(m)) M Nmz,/K (CL, (m)) 
Nmyp,nt2/K(Ciinta(m)) = Nmz,/«(C1,(m)) - Nm(Cz,(m)). 


Note Theorem is like Theorem except that we’re only working with finite 
extensions L/K instead of putting them together into K®/K. We cannot combine the maps 
wr because they are defined on different groups. Hence we now take a different approach, 
using ideles. 


5 Ideles 


sec:ideles In this section we give an alternate statement of the main theorems of global class 
field theory. 
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In local class field theory, we had isomorphisms K™/ Nmy x (L*) = G(L/K). For this to 
be true, Nmz/«(L*) must have finite index in K*. However, this is no longer true when K 
is a global field. (If A is local, it is complete with respect to a valuation, and Nmz/x(x) = y 
has solutions in y for many x, in the same way that Hensel’s lemma often gives solutions 
over complete fields. ) 

We want to work with complete fields but AK comes with a bunch of different places. The 
solution is to complete K at every place at once and combine the information into the adele 
ring and idele group. Then we will get statements for global class field theory that resemble 
local class field theory, with A replaced by Cx, a group related to the idele group (to be 
defined). 


Definition 24.5.1: Abbreviate @, = Ox,. The adele ring of K is 


( ) 
Ax = 1 (ae) E Il Ky, : a, € @, for all but finitely many am . 


vEVK 


We write this as TT, cy, (Mv, Gy). Equip it with a topology by letting a basis for open sets be 
II, Uv, where U, is open in K, for all v and U, = @, for almost all v. In other words, it is 
the unique topology from which [],, @, inherits the product topology and is open. 

The idele group of K is the group of units of the above: 


( ) 
x =Axke= JI Ee, oo \= 1 (2) € |[ Kx: a, € G for all but finitely many a 


vEVK vEVK 


Equip it with a topology by letting a basis for open sets be J],, U,, where U, is open in Kk; 
for all v and U, = G@* for almost all v. In other words, it is the unique topology from which 
II, Gx inherits the product topology and is open. 


Be careful: the topology of the idele group is not the subspace topology induced from 
the adele ring. 


Definition 24.5.2: For a finite set S containing all infinite places, let I} = T],<g KX x 
Tl.gs @. In other words, Iz contains those ideles that are units away from S. Give Iz the 
subspace topology inherited from Ix. 


Note the topology on I?- is just the product topology, and that I = Ug IP. 
K sik 


Proposition 24.5.3: I?. is locally compact. 


Proof. TIyes Ky is a finite product of locally compact spaces; I],¢5 @; is a product of com- 
pact spaces (Proposition |21]21.1.3) so compact by Tychonoff’s Theorem. Since a finite 
product of locally compact spaces is compact, the result follows. 
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Think of the ideles as a thickening of ideals: it includes factors for infinite places, and 
includes units at finite primes. We can embed K™ via the diagonal map, and Kk; via the 
inclusion map. 


Definition 24.5.4: Define i: K © Ax by the diagonal map i(a) = (a,a,...) and i, : 
Ky © Ax by the inclusion map i,(a) = (1,...,1, a, ,1,...,1). Also denote by 7,7, the 
Ww 


UV 


maps restricted to K* 4 Ig and i, : K, © I. 


Proposition 24.5.5: pricaiscrete (*) is discrete in Ix. 


Proof. Given a € K™%, let S be set of places contining the infinite places and the finite places 
where u(a) # 0. Consider the open set 


U={x elk: |x, —al, <¢ forv € S, x, € U, for v ZS} 
containing i(a). If i(b) € U with a F b, then 


[[lo-eh <el <1, 


contradicting the product formula Hence i(K*) NU = {i(a)}. 


Definition 24.5.6: The idele class group is defined to be 


Cr= n/K”, 


where K* is thought of as a subgroup of Ix by the diagonal map 7. 


We define a norm on adeles by defining it componentwise. 


Definition 24.5.7: The norm, from L to K is the function Nmj;/« : Ay + Ax defined by 


WU 


Nmp/K ((w)wev,) = (I Nine) 
vEVK 


This descends to a function Nmyz/x : I, > I. 


5.1 Ray class groups vs. ideles 


We will need the following to go between the interpretations of global class field theory via 
ray class groups and via ideles. The statement in terms of ray class groups is easier for 
concrete applications, but the statement in terms of ideles is better abstractly, and more 
convenient to prove. (But to complicate things more, certain parts of the proof will be easier 
to think of in terms of ray class groups.) 
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We can go from Ix — I easily, via the map 


eciidele-to-ideat(@) = Il peer) (24.3) 


v=Up finite 


(also denoted simply (a)). However, if we want the image to be in ree we need to focus our 
attention on a subset of ideles I(1,m) (defined below). Taking the map Ix (1,m) > 12!" 
and modding out by appropriate groups then makes it a bijection. We also need to check 
that we don’t lose anything when we consider only ideles of the form Ix(1,m); that is, that 
the inclusion Ix(1,m) © Ix is a bijection, again after modding out by appropriate groups. 
This is Proposition [24.5.9] below. 


Definition 24.5.8: armoreiacieats For a place v | m, define 


( Reo, v real 
in), = ; 
1+ p™), v= wy finite. 
Let @* be the group of units of K,. (For v infinite, OX := K%). Define 
eq:iklm. K(1, m) = Il I(m) « TI (KS 0%) (24.4) 
vim vfm 
Ux«(1,m) = [[Z(m), x [[ & 
vim ofm 


K(1,m) = i(k*) OI1g(1, m). 
Let Ux — UK(1, 1). 
Compare (2 to the definition of Px(1,m). 


Proposition 24.5.9: pr-idete-ray-ciass We have the following maps. 


K(1,m)/K(1, m) 


aa 


k/K* =Cx 


x(1,m)/K(1, m)Uq (1, m) — +Cx(m). 


The bottom map is induced by the map p : Ix — J? and the top map is induced by inclusion. 
Moreover, for any finite Galois L/K such that 


Ux(1,m) C Nmz/x (Iz), 


this diagram induces isomorphisms 


K(1, m)/[K” Nmp/K LO K(1,m)| = K/K* Nmyp/K 


el 


IIe 


TR/(Px (l,m) -Nmzjx (U7). 
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Proof. For the bottom map, consider the exact sequence 


03 K* AIx¢(1,m) = K(1,m) 5 Ix(1,m) 3 O™ 5 0. 


We have that I¢(1,m)/K(1,m) = coker i, so we use the kernel-cokernel sequence.' We have 
ker p = Ux(1,m), and cokerpoi = I°™/p(K(1,m)) = [5 /Pr(1,m) = Cx(m), so this 
gives the exact sequence 


Ux(1,m) > Ig (1, m)/K(1,m) > Ce(m) > 1, 


which gives the bottom isomorphism. 

The ee wal is clearly injective. For surjectivity, take a € Ix. By the weak approximation 
unporere | there exists b so that * € p™) +1 for every v = Up dividing m. Then 
+ €Ix(1,m), and its image varies in Ix ae a by the constant factor b € kK”. 

Now we chow the second diagram. (Warning: this proof is not very Silightentny) Let 
p and p’ denote the maps Ix, — Ig and Ix(1,m) > I?, respectively. Note that the first 
diagram gives isomorphisms 


«(1,m)/((K* Nmz x Iz) 1x (1, m)) —— x/K* Nmzx In 
«(1,m)/K (1, m)Ug(1, mp! (Nz x (I) ———> IR /(Pr(1, m) - Nmze(IP))- 
We have that 

K(1,m)Ux(1, m)p” *(Nmzj«(IP)) (24.5) 
emt = A(1,m)Ugi,mjp *((pfe/r) |wlu¢S)), fu = residue degree (24.6) 
eq:irc2 = K(1, m)UK(1, m)( K(1,m) a) Ux Nmzx( L)) (24.7) 
eq:irc3 = Ux (1, m)( K(1,m) a) (k* Nmy/K i) (24.8) 
eq:iired = (ik Nmy/K ie) a) «(1,m). (24.9) 


(24.6) follows from the fact that if $B | p, then ee = p/(¥/")_ To go between (24.6) 
and (24.7), note that pf(@/”) = p(Nmz/x(1,...,1, tw, 1,---,1)), and that ker(p) = Ux. 
er 


Now we go between (24.7) and (24.8| (24.8). For “C,” suppose a € Ux and 6 € Nmz/x (Iz) such 
that aNmy;x% b € ie Suppose c agrees with a for every v | m, and is 1 everywhere 
else. Then ac7! € Ux(1,m) CI«(1,m). Since aNmy/x 6 € I¢(1,m) as well and Ix (1, m) is 
a group, we must have cNmz/x 6 € Ix(1,m). Hence 


= = 
aNmy/xKb= ac cNmz/x b . 


a 
EU (LM) ep, (1m)n(KX Nmz/x In) 


1Given AS BS C, there is an exact sequence 
0 — ker f — ker go f — ker g — coker f + coker go f — coker g > 0. 


This is proven using the snake lemma. 
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as needed. Furthermore note K(1,m) CI«(1,m)N.A* Nmy x I,. For “3,” suppose a € Kk” 
and b € Nmz;x (Iz) such that aNmz/x b € Ix(1,m). By weak approximation, take c € kK” 
sufficiently close to i with respect to v, for every v € m, so that Nmz/x«(cb) € Ix(1,m). 
Then a Nmz/x«(c~') € Ix(1,m) as well, and in fact in K(1,m). Then 


aNmy/x b =aNmyz/x(c"') Nmypjxcb 
€K(1,m) €lx« (1,m)ANmz/x Ip 


as needed. 
The last step (24.9) follows from the assumption on m. 


Example 24.5.10: cx:ciass-group-idele-quotient Recall how we realized the class group and narrow 
class group as ray class groups in Example We now realize them as quotients of the 
idele class group. 

Take m to be 1. Then the bottom map gives an isomorphism 


Kk/K*Ux = Ck 


where Cx is just the class group of K. This realizes the class group of K as a quotient of 
the idele class group. 
In general, for any modulus m, 


K/K*UK(1,m) & In (1,m)/K(1, mUK(1,m) = Ce(m). 


This realizes the ray class group modulo m as a quotient of the idele class group. 

In particular, m = 1 was the case above. Taking m = J], ya, U, Px(1,m) is the group 
of principal ideals generated by totally positive elements (also written P;?) and Ux(1,m) = 
T]y rea Reo X IL, GX. This realizes the narrow class group of K as a quotient of the idele class 
group. 


Remark 24.5.11: :m-sramity The condition on m in Proposition |24.5.9] was that Ux(1,m) C 
Nmyz/« (Iz). We claim that we can always choose such m, such that S(m) consists of exactly 
the primes ramifying in L/K. 
The condition Ux (1,m) C Nmz (Iz) says that OF C Nmypvo/x,(L”) for all v { m and 
I(m),y GC Nmyp/x,(L”) for all v | m. Now note the following. 
1. If L/K is unramified at v, ie. L°/K, is unramified, then 
Nmz»/xK, i) = lh": Kuld Gx ») ie 


UV 


This is a consequence of local class field theory (Example |27]27.5.1). 


2. Nmy»/x«,(L”) is an open subgroup of K, (this is the easy direction in Theorem |24.2.3) 
and U\) := 1+ 2"@, is a neighborhood base of 1 in Ky. 


By item 1, m doesn’t need to include the places where L/K is unramified, and by item 2, for 
all ramified v we can choose the power of v in m large enough to force U C Nmzpu/x,(L’*). 
Then we will have Ug(1,m) C Nmz/«(Iz). 
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6 Global reciprocity via ideles 
sec:global-reciprocity-via-ideles We now state global reciprocity in terms of ideles. 


Theorem 24.6.1 (Global reciprocity, ideles):  tim:giobat-reciprocity-idetes Given a finite abelian ex- 
tension L/K, there is a unique continuous” homomorphism ¢;/x that is compatible with 
the local Artin maps, i.e. the following diagram commutes?: 


pe Ca) 


Kx » G(L/K,). 


Moreover, ¢,/x satisfies the following properties. 


1. (Isomorphism) For every finite abelian extension L/K, ¢x defines an isomorphism 


bry: Cx/ Nmzjx(Cr) = Ix/(K* -Nmzx(It)) 3 G(L/K). 


2. (Compatibility over all extensions) For L C M, L, M both finite abelian extensions of 
K, the following commutes: 
G(M/K) 


OM/K 
PL 


~¢—F OR) 
Kk as amap Ix > G(K?»/K). 


Thus we can define ¢x% := Lin 1 Sacra EY 


3. (Compatibility with norm map) ¢x is a continuous homomorphism Ix > G(K?/K), 
and the following commutes. 


1 —= 4 G(T /5) 
[seus [eto 


es G(K®>/K) 


Note that in the local reciprocity theorem |24.2.1| the “compatibility over all extensions” 
was automatic when we declared the existence of dx : KX —+ G(K??/K). We stated the 
global reciprocity theorem a bit differently, in the above fashion for easy comparison with 


global reciprocity in terms of ideals |24.4.1 


2G(L/K) is given the discrete topology. 
’This implies that if v = vp is unramified in L, then $7/K(i»(t)) = Frobz/x(p). Global reciprocity is 
sometimes phrased in this way, though the phrasing using the local map gives a bit more information. 
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Remark 24.6.2: rem:sen Uniqueness and existence of d,/« is easy, and parts 2 and 3 are easy 
given the existence of @,. The crux of the theorem is again part 1. 

For uniqueness, note that the ¢,/x is determined by its action on Ky, since for x = (zy), 
we must have 


brK(x) = J] o(2.). 


vEVK 


(The product is Cauchy in the topology of Ix.) This does define a continuous map on Ix 
because ¢,(%,) = 1 whenever x, € @% and v is unramified, and this happens for all but 
finitely many v. 

Parts 2 and 3 follow from the corresponding statements for local class field theory (see 
Theorem 24.2. 1]and the paragraph above this remark), by how @ is defined to be compatible 
with the local maps. 


The idele version of global reciprocity allows us to recast the Existence Theorem |24.4.4 
in a format more similar to the Existence Theorem in [24.2.3 


Theorem 24.6.3 (Existence theorem): tnm:giobal-ct-iacies For every subgroup N C Cx of finite 
index, there exists a unique abelian extension L/K such that Nmp/q« Cz = N. 


Combining the two theorems, we can recast the bijective correspondence in Theorem|24.4.7 
in a format more similar to local class field theory |24.2.5 


Theorem 24.6.4: snm:gett-bijection The map L ++ Nmy/« (Cz) is an inclusion-reversing bijection 
between the set of finite abelian extensions of K and the open subgroups of finite index in 
Cx, that switches intersections and products: 


LOM => Nmz/x(Cz) 2 Nmyx(Cwm) 
Nmy,15/K(Cr,1.) = Nmz,/x(Cz,) A Nmz,/x(Cz,) 
Nmypint2/K(Crynt2) = Nmz,/«(Cz,) : Nmz,/x(Cz,). 
Similar to Theorem [24.2.4] we have the following topological isomorphism for global class 
field theory. 


Theorem 24.6.5 (Topological isomorphism for GCFT): timsctt-topoicey Let A be a number 


field. Let 
(KX) := J] Rsox JJ Cx JI 1. 


v real v complex VE ve 


The Artin map ¢x is surjective and induces a topological isomorphism 


K/K*(K%)° = G(K°?/K). 


6.1 Connecting the two formulations 


sec:connecting-formulations We now show that the two formulations of global class field theory are 
equivalent, in the following sense. 
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Theorem 24.6.6: tnm:gett-equivaiens We have the following implications. 


1. (Global reciprocity, ideles => ideals) If Theorem|24.6.1{1) holds for a given L/K, then 
Theorem |24.4.1(1) holds for L/K. If Theorem ]24.6.1)holds for all L/ over a specified 


basefield (e.g. Q), then Theorem |24.4.1| holds for all such L/K. 


2. (Global reciprocity, ideals => (ideles)—e) If Theorem 1)-(2) holds for a fixed K 
and a family {L/k} such that the compositum of the L” contains K'"" for every finite 
place v, then Theorem 1)-(2) holds for the same K and {L/K}, except that the 
resulting map $,/~K may not be compatible with }, when v is archimedean. 


3. (Global existence) Given Theorem |24.6.1} Theorems |24.4.4] and |24.6.3] are equivalent. 
4. (Bijective correspondence) Given Theorem |24.6.1| Theorems |24.4.7| and |24.6.4] are 


equivalent. 


Proof. For parts 1 and 2, we note that by Proposition |24.5.9| 


eq:gett-equivalentC x / NmypK C, = K/K* Nmyz/x L = I?-/ Px (1,m) Nmz/x (17), (24.10) 


where by Remark|24.5.11} we can choose m to some modulus containing only ramified primes, 
and S = S(m). Thus any one of the dotted isomophisms below gives the other isomorphism. 


gcft-equivalent2 K/K* Nmz/x( L) (24.11) 
ee _ 
p|= G(L/K) 
ot 


For part 1, given ¢z/%, we define ~z/x with the above diagram. Then, supposing p corre- 
sponds to the uniformizer 7, € Ky, 


bry (P) = vex (Pli(tr))) = Ox/x UM) = Gu(te) = Frobse x, ((T)) = Frobrx(p), 


”% 


as needed. Part 2 is a more complicated; we'll give the proof below after a lemma. The “—e 
comes from the fact that the formulation in Theorem [24.6.1]says nothing about archimedean 
primes. 

Parts 3 and 4 now result directly from the fact that gives a bijective correspon- 
dence between subgroups of two groups. 


Lemma 24.6.7: tem:ocal-uniqueness Suppose that K is a nonarchimedean local field, A is the 
maximal abelian unramified extension of K, and L is an abelian extension containing Kk. 
Let f : K* + G(L/K) be a homomorphism satisfying (1) and either (2) or (2)’: 
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1. The composition K~ a G(L/K) + G(K™/K) takes a to Frobgur/x (7). 
2. For any uniformizer 7 € K, f()|x, = 1, where 


Ine. 


2’. For any finite subextension K'/K of K,, we have 
f(Nmx/x(K))|Kr = {1}. 


Then f equals the reciprocity map ox. 
For the proof, see Section |27]8.1 


Proof of Theorem |24.6.6, Part 2. Given wWzr/K we define @p/K using (24.11). The Wr/K are 
compatible by Remark (24.4.2), so the @,/« are compatible (details omitted) and we can 
define dx = lim | @ryk Where the limit is over L/K in the given family. Let L’ be the 
compositum of the fields L. 


We check the hypotheses 1 and 2’ of Lemma [24.6.7] Let 
fu = OK Oty : KY > G(L"/K,). 
Item 1 is clear as gives letting v = vp, we have 
Ox (tv(@)) Lge = Ve (p"™) lyr = Frobiyr/x, (a). 


Item 2’ follows from part 3 of Theorem |24.4.1) applied to K’/K (see Remark |24.4.2): we 
get wy/K(Nmqr/«(Lz/))|K” = 1 which translates into $x (i,(Nmx7/«,(K)")))|K, = 1. Thus 
fv = oy for all finite places, as needed. 


We have proved the ideal version of global class field theory for cyclotomic extensions of Q. 
Our plan of attack will be to show transfer this to the idele version for cyclotomic extension 
of Q, then work on proving the idele version. Then we will be done by Theorem |24.6.6 


7 Kronecker-Weber Theorem 


ww As a first application of class field theory, we explicitly describe the maximal abelian 
extensions of Q, and Q. 


Theorem 24.7.1 (Local Kronecker-Weber theorem): tw: Every abelian extension of Q, is 
included in a cyclotomic extension, i.e. an extension Q,(¢,), CG, a primitive nth root of unity, 
for some n. In other words, 


Q? —= Qp(Gn | ne N). 
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Theorem 24.7.2 (Kronecker-Weber theorem): iw: Every abelian extension of Q is included 
in a cyclotomic extension Q(¢,,). In other words, 

Q” = Q(G, |n EN). 
Proof of Theorem|24.7. 1, Consider Q,(¢,) where p { k. Let U denote the group of units. As 
Q,(¢.) is unramified, local class field theory tells us 


Nimg, (¢¢)/Qp(Qp(Ge)*) & a!CrSe)Orl277 
Consider Q,(¢,m), which is totally ramified of degree p”~'(p — 1) over Q,. Local reciprocity 
gives 
Q/ Nm@Q,(¢5m)/Qp (Qn(Gpm)*) > G(Qa(Gm)/Q)- 


Thus both sides have the same order, p™~!(p — 1), and we must have 


Nmg,(6m)/Qp(Qp(Gpm)*) = U™ := p*(1 + (p™)). 


Suppose L/Q,, is an abelian extension. Its corresponding norm group N is open of finite 
index in Q,,, so contains 


p™(1 + (p™)) 
for some n,m. Choosing k large enough we may suppose n | [Q,(¢x) : Q,]. Then using 
Theorem |24.2.5/, 
N D Nm(Qp(Gn)*) OF Nm(Qp(Gpm)*) = Nm (Qp(Gnpm)*). 
By Theorem |24.2.5| we get that Q,(Grpm) D L. 


Proof of Theorem|24.7.4 Given an abelian extension K’/Q, choose a modulus m so that the 
Artin map is defined. Every modulus for Q divides oo(n) for some integer n. The ray class 
field of co(n) is Q(¢,). If m divides co(n), then K is contained in Q(¢,). Hence the maximal 
abelian extension is the union of all the Q(G,). 


We can similarly ask how to characterize abelian extensions of other number fields Kk. 
This is Hilbert’s Twelfth Problem and Kronecker’s Jugendtraum. Note that another way to 
phrase this theorem is the following: 


1. Q* is generated by the torsion points of Q* under multiplication. 
2. Let f(z) = e?. Then Q® is generated by f(Q): 
Q*” = Q(F(Q)). 


We can ask: for given K, can we get K?> by adjoining torsion points of some algebraic 
variety, and does there exist a nice function g(z) parameterizing this variety, so that 

K* = K(g(K))? 
It turns out that the answer is affirmative for quadratic extensions: roughly speaking, the 


maximal abelian extension is generated by torsion points of elliptic curves with complex 
multiplication. We will give a complete solution to this problem in Chapter [41] 


‘omitting the subscripts on norms to avoid clutter 
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8 Problems 


1.1 Why can’t we define Frob, € G(L/K’) when p is a prime in Kk that is ramified in L? 
12 Pixn-e N, 


(a) For which primes p € Z does (p) split completely in Z[¢, + ¢7']? (Be careful with 
p=2.) 
(b) Show that the ray class field of (n) is Q(¢, + G71). 


1.3 (IberoAmerican Olympiad for University Students, 2010/6) Prove that, for all integers 
a > 1, the prime divisors of 5a* — 5a? + 1 have the form 20k + 1. 


1.4 Consider the field extension Q( Wd, C3)/Q where d € Z is not a perfect cube. Let p be 
a prime relatively prime to 3d. Prove that a prime p splits into n factors in Q(Wd, ¢3), 


where 
(>, p=1 (mod 3) and d is a cube modulo p 
n= 1 3, p=1 (mod 8) and d is not a cube modulo p 
(6, p=2 (mod 3). 


2.1 Recall that G(i/K) has profinite (Krull) topology. Topologically W(K/K) is a Z- 
disjoint union of G(K/K)o-cosets G(K/K)oon, where on is any lift of Frobj, n € 
Z, where each G(K/K)oon is given the same topology as the profinite topology on 
G(K/K)o via translation by on. 


(a) Show that the natural inclusion .: W(K/K) > G(K/K) is continuous and has 
dense image. 


(b) Show that v is not a topological isomorphism onto u(W(K/K)), where the latter 
is equipped with the topology induced by that of G(K/K). 


4.1 Characterize all quadratic extensions A’/Q that are contained in a Z/4-extension. (Ben 
Blum-Smith, from |nttp://math.stackexchange.com/questions/596195/ 
conceptual reason-why-a-quadratic-field-has-—1-—as-—a-norm-if-and-only-if. 
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Chapter 25 


Group homology and cohomology 


group-hom-cohom In this chapter we introduce the theory of group homology and cohomology. 
In the next chapter we’ll specialize to the case of Galois groups, and then we’ll use Galois 
cohomology to prove the theorems of class field theory. Some results in this chapter will be 
given without proof; for detailed proofs see Rotman [Rot09]. We assume knowledge of some 
basic terminology and facts from category theory and commutative algebra (covariant and 
contravariant functors, natural transformations, left and right exactness). 

The idea of homology and cohomology—used in many different areas of mathematics—is 
that after applying a functor, a short exact sequence of modules may no longer be exact. 
Instead, we get the long exact sequence in (co)homology, with the (co)homology groups 
measuring the deviation from exactness. 

Exactly what functors are we applying? In group cohomology (Section (6), we apply 
Homg(Z, e), turning a short exact sequence of G-modules 


0O- AO BOC 30 


into 
favtntae() —> AS —> BG —> Ce —> HG, A) a ae (26.1) 


where A® is the submodule of A fixed by G. In the next chapter we will take A, B,C to bea 
multiplicative or additive subgroup of a field L, and G= G(L/K). Then A® is just AN K. 
Thus we see that the sequence gives information about the relationship between a 
field K and an extension field. For example, in Kummer Theory we take C = L*"; 
then C¢ = L*" 1 K, and we can characterize G(L/K) and hence L/K in terms of the nth 
powers of LZ appearing in K. This is representative a general trend in class field theory: 
characterize extensions of K in terms of information intrinsic to K. 

We also get a sequence in group homology (Section Is), and we can splice the sequences 
for homology and cohomology together to get the Tate groups (Section|9). Norm groups will 
make their appearance here—which is how, in class field theory, we get a correspondence 
between norm groups and field extensions. 

Finally, we assemble a toolbox of other constructions from group cohomology and ho- 
mology, including cup products (Section {10}, changes of group (Section [11), the corestriction 
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map (Section |11.5), results on cyclic groups and the Herbrand quotient (Section [12), and 
Tate’s theorem (Section [13). We include generalizations of cohomology to profinite groups 
(Section [14) and nonabelian groups (Section [15). 


1 Projectives and injectives 


Let A be an abelian category.! The reader unfamiliar with category theory may assume that 
A is the class of R-modules, since we will be primarily working with modules throughout. 


Definition 25.1.1: Let A be an abelian category. 


1. An object P € A is projective if for every surjection p: M -» N and morphism 
f:P—-—N, there exists a unique morphism g: P + M such that f = pog: 


9 ri 
Ze |) 
4 
M =a N 
Equivalently, Hom(P, e) is exact (or equivalently, right exact as it is always left exact).? 


2. An object I € A is injective if for every injection i: M << N and morphism f : M > 
I, there exists a unique morphism g : N > J, such that f = g 01: 


sth 
“wv 


I 
Equivalently, Hom(e, /) is exact (or equivalently, just right exact). 
Example 25.1.2: A free R-module (a direct sum of copies of R) is projective. 
Definition 25.1.3: An abelian category A... 


1. has enough injectives if for every object A € A there exists an injective object EF 
with a monic (injective) morphism A @ E. 


2. has enough projectives if for every object A € A there exists a projective object P 
with an epic (surjective) morphism P —» A. 


‘A category is an abelian category if it is an additive category such that every morphism has a kernel 
and cokernel, every monomorphism (injection) is a kernel, and every epimorphism (surjection) is a cokernel. 


?The diagram is equivalent to saying that if p : M -» WN is surjective, then so is the map 


Hom(P, M) “2"°), Hom(P, N), i.e. Hom(P,e) is right exact. 
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Definition 25.1.4: A projective resolution of A is an exact sequence 
Poi Spo ASR SASSO 
where each P, is projective. 
An injective resolution of A is an exact sequence 
E:03A3 EP 214, pig... 
where each E” is injective. 
Proposition 25.1.5: If A is an abelian category with enough projectives (injectives), then 


every object has a projective (injective) resolution. In particular, every R-module has a 
projective (injective) resolution. 


Proof. Build the resolution step-by-step. See Rotman |Rot09}, Proposition 6.2-5. For the 
second part, note that the category of R-modules has enough projectives and enough injec- 
tives. 


2 Complexes 


Definition 25.2.1: A complex in an abelian category (for example, the category of R- 
modules or abelian groups) is a sequence of morphisms 


Cie ee, Be 
such that the composition of any two adjacent morphisms is 0: 
dndn+1 = 0. 


We often work with complexes only going off to the left or right (positive and negative 
complexes, respectively), and label them 


SO PO, poe Oy 0 
($C S309 22 Osu 
We will want to work with complexes like they are single objects. 


Theorem 25.2.2: The class of complexes in A can be made into an abelian category, 
Comp(A) as follows: The objects are the complexes and the morphisms are chain maps 
f =(fn): C > C’, i.e. a sequence of maps making the following commute. 
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Proof. See Rotman [Rot09], Proposition 5.100. 


We will be interested in cohomology and homology modules associated to chain com- 
plexes. For this, we have the following notion of what it means for chain maps to be “the 


same” (See Theorem |25.3.2). 


Definition 25.2.3: Two chain maps f,g : C + C’ are homotopic if there exist a family 
of morphisms s,, : C;, — C/,,, such that 


te — Gn = Gin ar By igs 


In Section [4] we will define the homology modules and cohomology modules from projec- 
tive and injective resolutions. To show this does not depend on the choice of projective or 
injective resolution, we need the following theorem. 


Theorem 25.2.4 (Comparison Theorem): thm:comparison Let A be an abelian category, and 
suppose we have two complexes C :--- > P; 3 Py 7 A> OandC’:---Pi/ > RA’ 0 
and a map g: A — A’. Then there exists a chain map f extending g: 


>P; > Po »A +0) 


|p | 9 


P{ —+ P! —+ A! —0. 


Moreover, f is unique up to homotopy. 
The same is true of complexes going off to the right (reverse the arrows above). 


Proof. Rotman [Rot09], Theorem 6.16. 


3 Homology and cohomology 


Definition 25.3.1: Given a complex C, define 

Zn(C) = ker(d,) 

B,(C) = im(dn41) 

H,{C) =Z,(C)/ BAC). 
H,, is called the nth homology module. For upper indexing, we let Z7"(C) = ker(d"), 
B"(C) = im(d""1), and H"(C) = Z"(C)/B"(C), and call H” the nth cohomology mod- 


ule. 
Think of H,, as measuring how far the complex is from being exact at Cy. 


Theorem 25.3.2: tnmtn-functor Let A be an abelian category. For every integer n, H,, is an 
additive functor from Comp(A) — A. Moreover, homotopic chain maps induce the same 
map in homology. 
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Proof. See Rotman |Rot09|, Proposition 6.8. 


Theorem 25.3.3 (Long exact sequence): ics A short exact sequence of chain complexes 


0 oC! >0 


induces a long exact sequence of homology modules 


On 


Ho, "5 A) 5 ag 
The map 0, is defined by 
Onleh] = lentsdn—1Pa Cn) € Ch_t.- 
Proof. Let H, = H,(C), B, = im(d"*!), and Z,, = ker(d”) for the complex C, and define 
Hj, H', and so forth similarly. By the Snake Lemma, the gray sequence below is exact. 


Hi, > H, > H" 


Vv 4 4 De Vv 
C! /B!, —> C,/Bn —> C”/B" ——> 0 


/ in 
dy-1 Catt dn-1 


Note that the connecting homomorphism is exactly that in the Snake Lemma. 


4 Derived functors 


sec:derived-functors 


4.1 Right derived functors and Ext 


4.1.1 Covariant case 


Given an injective resolution of B, 
B m mo d. mi dt a? 
EY: CA BoE OE Ook :::, 


AEE 
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applying a (covariant) functor T gives (after deleting TB) 


ee ST (25.2) 
We will primarily be concerned with the case where T = Hompr(A, e), so the above becomes 


Hom( 


[0) om 1 
— ) Hom(A, E") ae, ea) 


Hom(A,d?) 
poate sisson AC 


eq:THomO —> Hom(A, E’) Hom(A, Ey oar (25.3) 


Definition 25.4.1: Let T be a covariant functor. The nth (covariant) right derived 


functor of T is 
ker(Td") 


2 =f (Te?) = im(Poo) 


i.e. it is the nth cohomology module of (25.2). 
For a R-module E, define 


Ext’,(A, B) := (R” Hom,(A, e))B = H"(Hom,(A, E®)), 
ie. it is the nth cohomology module of (25.3). 


Here d7! is the trivial map 0 + E°. We need to show that this definition does not depend 
on the injective resolution chosen. 


Proof of well-definedness. Suppose we have two injective resolutions of B: 


1 


0 »B— > B® A | SO a 
| fo fi 
(69-557 gs 


Let (R"T)B = SOO?) and (RT) B = Say. 
By the Comparison Theorem there is a unique chain map f between the two 
resolutions, up to homotopy (the dotted lines above). Apply T to this diagram to get a 
chain map Tf, : TH" > TE". As H,, is a functor by Theorem [25.3.2 Tf induces a map on 
the cohomology modules (R"T')B > (R’"T)B. Since we can construct a chain map g from 
the second to the first resolution as well, (R"T)B — (R'"T)B must be an isomorphism. 
For the details, see [Rot09], Proposition 6.20. (The argument there is written for left 
derived functors, but the idea is the same.) 


4.1.2 Contravariant case 


We can define a companion functor ext?% that is contravariant instead of covariant. Given 
an projective resolution of A 


Pe a Sp SPP SAS 0, 
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applying a contravariant functor T' gives 


ee = TP, pp PP Fe, (25.4) 


To define ext, let T = Hompa(e, B). 


Definition 25.4.2: Let T be a contravariant functor. The nth (contravariant) right 
derived functor of T is 


(PT)As= HPs) = SE 
i.e. it is the nth cohomology module of (25.4). 
For R-modules A, B, define 
ext?(A, B) := (R" Homp(e, B))A = H"(Hompe(Pa, B)) 
Theorem 25.4.3: nxtisexi For R-modules A, B, 
Ext? (A, B) = ext (A, B). 


This theorem says that we have two choices when we need to calculate Ext(A, B), 
namely, 


1. Find a injective resolution of B and apply Hom(A,e) (the Ext perspective), or 


2. Find a projective (e.g. free) resolution of A and apply Hom(e, B) (the ext perspective). 
Proof. See Rotman |Rot09|, Theorem 6.67. 


4.2 Left derived functors and Tor 
Next we define left derived functors and Tor analogously. Given a projective resolution of A 
Ps et BoP SP SP SASSO 
applying a covariant functor T’ gives 
st Pe TP eT 
To define Tor, let T =e ®p B. 
Definition 25.4.4: The nth left derived functor of T' is 
(Lal) = AAT Ps) = at 
For A an R-module, define 
Tor? (A, B) := (Ln(e @z B))A = H"(Pa @p B) 
tor? (A, B) := (Ln(A @z 0))A = H"(A @p Pa). 


(Note Tor®(A, B) = tor®(B, A).) 
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Note unlike the case with Ext, we need only consider covariant derived functors: Homp 
is contravariant in the first entry and covariant in the second, while ®, is covariant in both 
entries. Similar to Theorem |25.4.3} we have the following. 


Theorem 25.4.5: toristor For A, B R-modules, 
Tor®(A, B) = tor™(A, B). 


Proof. See Rotman |[Rot09|, Theorem 6.32. 


4.3. Long exact sequences 


The most important property of the derived functors is that they repair “loss of exactness” 
after applying the functor. 


Theorem 25.4.6 (Long exact sequence): tcs-ex: Let 0 + A— B—C > 0 be a short exact 
sequence of G-modules. 


1. Let T be a left exact covariant functor. Then there is a long exact sequence 


0-027) A nese A 
TB TC 
2. Let T be a right exact covariant functor. Then there is a long exact sequence 


—> (Ly, Pos, [oT ) A —> (LoT') B —> (Lo T)C — 0 


The maps O” are given by the snake lemma. 


Proof. The long exact sequences exist by Theorem |25.3.3} (Note that the complexes only go 
off to the right /left in the two cases, respectively.) It remains to show the equalities. Take 
a projective resolution of A, 


Pe Pe Pe Ae 0, 


By right exactness of 7’, the following is exact: 


Td, 


Te PP =6 7A 0. 


Hence (Lof)A = TPo/im(TP,) = TA. 
The second part is similar. 


280 


Number Theory, §25.5 


Corollary 25.4.7: tcs-cxt-tor We have the long exact sequences 


0 —> Ext9,(M, A) ——> Ext®,(M, B) —— Ext®(M,C) 24 Ext!,(M, A) —> 


Homp(M, A) Homr(M, B) Homr(M, C) 
and 


= Torl (Cc, M) + Tor®( (A, M) —> Tor4’(B, M) ——> Tord'(C, M) —> 0 


M @rA MSrB M @rRC 


Proof. Homp(A, e) is left exact and e ®p B is right exact. 


Example 25.4.8: ex:ext-inj We have the following. 


B injective => Ext}(A, B) =0 forall A,n>1 

A projective = > Tor}(A, B) =0 for all B, n > 1. 
Indeed, recall that Ext is defined by taking an injective resolution of B and Tor is defined 
by taking a projective resolution of A, and in these cases we can take the trivial resolutions 


0O- BS BoO0and0- AS ASO. 


Example 25.4.9: exabetian-tor2=0 Take R = Z. Then a R-module is just an abelian group. 
Every group H has a free resolution of length 2: 


0O- Fk aw ham ASO. 


Thus ext?}(H,G) = 0 and Tor2(H,G) = 0 for n > 2. 


5 Homological and cohomological functors 


sec:cohom-functor This section is more abstract and may be skipped. 
As we saw in Corollary |25.4.7/and Example|25.4.8| the key properties of Ext’, are roughly 
the following: 


1. Ext®(A, B) = 0 when B is injective and n > 1. 
2. Short exact sequences give rise to long exact sequences. 
3. In dimension 0, Ext?,(A, B) = Hom (A, B). 
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We have a similar description for Tor”. 
We abstract the definition for Ext and Tor, by defining homological and cohomological 
functors. There are several reasons for doing this: 


1. We want to talk about natural transformations between cohomological functors. 


2. In the last section we showed the existence of Ext satisfying the above properties (and 
similarly for Tor). It turns out that these properties characterize it uniquely. Thus we 
can just “remember” these properties and forget the details of the construction. 


There are similarly other (co)homological functors, and we sometimes want to show 
they are equal. To do this, it turns out we can just construct an isomorphism in 
dimension 0, and the rest works out by abstract nonsense. (See Theorem |25.5.2!) 


Note in the above characterization of Ext we said Ext/(A,B) = 0 for n > 1 when B is 
injective. This is useful because every R-module has an injective resolution. In general, 
though, we may want to work with a general class of objects, say x (which in our case is the 
class of injective modules). The key property is that for every module A there is an injective 
module £ and an injective morphism A —> E, i.e. the category of R-modules has enough 
injectives. 


Definition 25.5.1: Let (7": A — B)n>0 be a set of additive functors on abelian categories, 
and let y be a class of objects in A. We say A has enough y-objects is every object in A 
can be embedded in an object in y. 

Supposing A has enough y-objects, (T”)n>o0 is a cohomological 0-functor if the fol- 
lowing hold. 


1. (IT")n>0 is x-coeffaceable: T"(X) = 0 for all X € x and n> 1. 
2. For every short exact sequence 0 > A > B + C -— 0 there is a long exact sequence 
0+ T°(A) +--+ 4 T"(A) 3 T(B) 4 TC) TA) 


such that the diagonal morphisms 0” are natural (with respect to maps between two 
short exact sequences). 


A morphism of cohomological 0-functors is a natural transformation 7” : T” — H” com- 
muting with the diagonal maps 0”. 


There is a similar definition for effaceability and homological 0-functors. We can also 
consider (T”),cz, that is 0-functors extending infinitely in both directions, replacing the long 
exact sequence with an infinite exact sequence extending in both directions. 

The following theorem gives existence and uniqueness of (co)homological 0-functors. 


Theorem 2 5 . 5 . 2 : thm:hom-functor-uniqueness 
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1. Suppose 7° : T° - T’° is a natural transformation of cohomological 0-functors in 
degree 0. Then there exists a unique morphism of cohomological 0-functors 7 : T — T" 
extending 7°. 


2. Suppose 7”",7’" : A — B are two cohomological functors, and there is a natural 
isomorphism T° Y T’”’. Then T” & T”. 


The same is true of homological 0-functors, and 0-functors extending in both directions. 


Proof. See Rotman [Rot09], 6.35. 


For example, Ext is characterized completely by the 3 properties we gave: it is a coho- 
mological 0-functor by items 1 and 2, and uniqueness comes from knowing it in dimension 
0 (item 3). Ditto for Tore. 


6 Group cohomology 


group-cohomology TO apply homology to groups, we will turn a group G into a ring, and consider 
modules over that ring. 


Definition 25.6.1: Let R be a ring. The group ring R[G] or RG is the ring 


with multiplication given by 


(= os) (= is) = ST agbagh. 


geéG heG g,heG 


We will always work with R = Z. 
Note that any action of G on a Z-module makes the module into a ZG-module. We often 
just abbreviate “ZG-module” as “G-module.” 


Definition 25.6.2: Let G be a group and A, B be left ZG-modules. 


1. The diagonal action of G on Homz(A, B) is given by 
(99)(a) = 9(y(g*a)). 
2. The diagonal action of G on A ®zq B is given by 
g(a @ b) = (ga) @ (gb). 
We now apply cohomology as follows. 
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Definition 25.6.3: Let M be a G-module. Equip Z with the trivial G-module structure. 
The cohomology groups of G with coefficients in M are defined by 


H"(G, M) = Ext%,(Z, M) = H"(Homzq(Z, E™)) 
= ext7¢(Z, M) = H”(Homze(Fz, M)). 


Note from Theorem |25.4.3| we have two choices in finding H"(G, M): find a ZG-injective 
resolution of /, or a ZG-projective resolution of Z. 
There is a nice interpretation of H°(G, M). 


Definition 25.6.4: Let L,M be G-modules and y be a map L + M. Define the fixed 
point functor by the following. 


1. Action on modules: 
M° ={meM:gm=m for allg eG}. 
2. Action on maps: Since y(L°) C M® we can define 
gp = Yylre. 
Proposition 25.6.5: conom As functors, 
H°(G,e) = Homzg(Z, e) = e%. 
In particular, the fixed point functor is left exact since Homzq(Z, e) is. 


Proof. Z is equipped with the trivial G-action. A G-homomorphism y from Z to M is 
determined by y(1), and y(1) must be a fixed point. Hence Homzg(Z, M) = M® via the 
map y ++ y(1). 


Remark 25.6.6: This gives us another way to think about group cohomology. Given M, 
take an injective resolution 0 — M > E°® > E' > .--. Applying Homzq(Z,e) to this 
resolution is the same as applying e°, so we get 0 + (E°)% > (E')° +---. Then H"(G, M) 
is the nth cohomology group of this complex. 


We will need the fact that cohomology preserves products. 


Proposition 25.6.7: cohom-preserve-prod Let G be a group and M; be G-modules. Then 
i («. neg = T] #"(G, Mi). 
tel wel 


Proof. First note that the product of injective modules is an injective module: By definition 
a R-module I is injective iff Homp(e, J) is exact. Thus, the statement follows from the fact 
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that Home (e, [I]; /;) = I]; Homa(e, /;), and the fact that a product of exact sequences is 
exact. 

Thus if £™” is an injective resolution for M;, then []; £™ is an injective resolution for 
II, “i, and we get 


H” («. Il i) = H" (Homgg (Z, Eller) = H" (Homa (z. II B™)) =] #"(G, M4). 


ier wel wel 


7 Bar resolutions 


sec:bar-res We now describe the cohomology groups, by working with an explicit presentation 
of Z. (We use the ext approach.) This will give practical interpretations of H'(G,M) and 
H?(G, M). For proofs, see Rotman [Rot09], Section 9.3. 


Definition 25.7.1: Define the bar resolution B(G) to be the exact sequence 


2S Boss Be eR 7 30 
where 
Bn & ZG° 
is the free abelian group with basis elements denoted by |x|---|v,], and 
eabar-adn([21| +++ [@n]) = €1[22| +--+ [en] + xe )ea|=s* | Baga |> es en) 1)" a|+* |e. 
YH 


(25.5) 
Let U, C B, be the submodule generated by [2|---|z,] where at least one of the 2; 
equals 1, and define the normalized bar resolution to be the quotient complex B*(G) := 


B(G)/U(Q). 


Note in particular 


ds[z|y|z] = x[y|z] — [xylz] + [zlyz] — [zy] 
do|z|y] = x[y] — [xy] + [2] 
d(x] = 2[] — [] 
do|| = 1. 


We have Homg(B,, M) = Home (ZG @" , M), so it can be identified with the set of functions 
G”" + M. Working out the kernels and images, we get the following. 


Theorem 25.7.2: expiicit-n1 We have the following descriptions of H'(G,M) and H?(G, M). 
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1. Define a derivation (or crossed homomorphism) of G to be a function G + M such 
that 
d(xy) = d(x) + ad(y) 


and a principal derivation to be one in the form 
d(x) =a— <a, for some a € M. 


Denote the set of derivations and principal derivations by Der(G, /) and PDer(G, M). 


Then 
H*(G, M) & Der(G, M)/ PDer(G, M). 
2. We have 
BGM) = LLiGX GM: fav) + fen) = 2fly.2) + flee), (el) = fy) = 0) 


{9g:GxG—oM:g(z,y) = ch(y) — h(zy) + h(x) for some h: G > M} 
The elements in the top set are called factor sets. 
A particularly important case is the following. 
Corollary 25.7.3: i1ishom Suppose G acts trivially on M. Then 
H'(G,M) & Homz(G, M). 
(On the RHS, G and M™ are thought of as groups.) 


Proof. Because the action is trivial, a derivation is just a function with d(xy) = d(x) +d(y), 
i.e. a homomorphism. Moreover, any principal derivation is trivial. 


8 Group homology 
group-homology 


Definition 25.8.1: Let A be a G-module. Equip Z with the trivial G-module structure. 
The homology groups of G with coefficients in Z are defined by 
H,,(G, A) = Tor’°(Z, A) = Hy (Pz @z2c A) 
= tor" (Z,, A) => H,,(Z ®za Py) 


There is similarly a nice interpretation of Hp(G,M), as well as of Hi(G,Z). Given a 
group G, define the map ¢ : ZG > Z by « (ee Qg9) = VyeG 4%, and define 
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Proposition 25.8.2: 1om As functors, 
Ay(G, e) = e/Ice; 
i.e. there is a natural isomorphism 


Hy)(G,A)=Z@gA- A/IGA 
mearymatIgA. 


Proof. The short exact sequence 0 + Ig + ZG —> Z gives exactness of 
Ig @®g@ A> ZG ®g@A>Z&®gA-> 0 
since tensoring is right exact. (G acts trivially on the Z on the right.) Thus, 


Hy(G, A) =Z @g A= (ZG ®g A)/(I¢ @g A) = A/IGA. 


Proposition 25.8.3: i:-isza» There are canonical homomorphisms H)(G,Z) ~ Ig¢/I2 = G?”. 


Here G?> denotes the abelianization of G, i.e. G/G’, where G’ is the derived subgroup, 
the (normal) subgroup generated by the commutators aba~'b7!. 


Proof. The long exact sequence in homology for 0 > Ig + ZG > Z — 0 is 


Ig/ I, 


The left term is 0 by Example [25.4.8]since ZG is free, hence projective. Thus 0 is injective. 
From Proposition 25.8.2} we get the middle two inequalities (since Hp(G, ZG) = ZG'/IgZG = 
Z). Surjectivity of the map Z — Z gives that it is actually an isomorphism, so exactness 
gives 0; is an isomorphism. It remains to show 


Define a map f : G > Ig/I2, by letting f(x) = (x — 1) mod J%. This is a homomorphism 
because 


f(y) = zy —1 mod I3 
= (x —1)+(y—1) mod IZ (cx -1(y-lhe 2 
= f(x) f(y). 
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Now G’ € ker f since Ig/J2, is abelian (ZG, as an additive group, is abelian), so we get a 
map f : G/G’ > Ig¢/I2. 

Now define g : Ig > G/G’ by g(x — 1) = xG’. (Note x — 1,2 € G\{1}, is a free basis for 
G.) We have 


g ( >». men, (ey = 1) -@=-1)-@- 09] 
xz,yeG\{1} 
= Il (xyz ty *)™"G" —_ c’ 
xz,yeG\{1} 


of ~~ m(z-1) my) 


xeEG\{1} yeG\{1} 


so g induces g : Ig/IZ, > G/C’. 
Now f and g are inverse, showing (25.6). 


8.1 Shapiro’s lemma 


shapiro Shapiro’s lemma will be helpful in computing (co)homology groups, especially in the 


guise of Corollary |25.8.8 
Definition 25.8.4: Let S C G be a subgroup of finite index. Define the induced and 
coinduced modules to be? 
Indg(A) = A @zs ZG. 
Coindg (A) = Homzs(ZG, A). 


If S = {1} we simply write Ind°(A) or Coind®(A). An induced module of G is a module 
in the form Ind°(A); a coinduced module of G is a module in the form Coind® (A). 


Remark 25.8.5: rem:sinite-inducea If G is finite, the induced and coinduced modules are canoni- 
cally isomorphic via the below map, so there is no need to distinguish between them. 
Homs(ZG, A) > A @zs ZG 


pr > v(g7") zs 9. 
gEG/S 


Proposition 25.8.6: prcoinducea-subgroup If M is a coinduced G-module, and H C G is a sub- 
group, then M is a coinduced H-module. 


Proof. Write M = Homz(Z[G], A); we can write Z[G] = Z[H]| ® B; then we have by adjoint 
associativity? that M = Hom(Z[H] @ M, A) = Hom(Z[H], Hom(M, A)). 


3Be careful; in some books the definitions are reversed. We follow Serre’s definition, which is the opposite 
of Milne’s definitions. 

‘If R,R’ are rings, M is a R-module, N is a (R, R’)-bimodule, and P is a R’-module, then there is a 
canonical (R, R’)-isomorphism Homry(M, HompR (N, P)) = Home (M @R N, P). 
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The cohomology of coinduced modules and the homology of induced modules are easy to 
calculate. 


Lemma 25.8.7 (Shapiro’s lemma):  shapiro-temma The following hold. 
H"(G, Coindg(A)) = H"(S, A) 
H,(G, Ind@(A)) = H,,(S, A). 


Proof. Let Pz be a ZG-projective resolution of Z. Note it is also a ZS-projective resolution, 
as any ZG-projective module is ZS-projective. 
By definition of cohomology group, 


H"(G, Coind§(A)) = H"(Homzg(Pz, Homzs(ZG, A))) 
© 1"(Homzs(Pz ®z¢ ZG, A)) = H"(Homzs(Pz, A)) = H"(S, A). 


In (*) we used adjoint associativity. 
By the definition of homology group, 


H,(G, Ind¢(A)) => A, (Pz &2G (ZG CASI A)) = A,(Pz 79 A) = HAS, A). 


Corollary 25.8.8: snapiro-cor Suppose that A = @j<; Ai, S = Stab(A;) (defined as {g € G: gA; = A;}), 
and G permutes the submodules A; transitively. Then 


H,(G, A) = H,(S,A;). 


If G is finite, then 
H"(G, A) = H"(S, A;). 


Proof. We have A = Ind Aj. If G is finite then A & Coind§ A; as well. 


Corollary 25.8.9: corina-o If M is an coinduced G-module, then H"(G, M) = 0 for all n > 1. 
If M is an induced G-module, then H,(G, M) = 0 for all n > 1. 


Proof. By Shapiro’s lemma|25.8.7} 
M = Coind®(A) —s H"(G,M) = H"(1,M)=0 
M = Ind°(A) = H,(G,M) = H,(1, M) = 0 


We used the fact that Z is Z[{1}]-projective. 


9 Tate groups 
tate-groups By Corollary |25.4.7| given a short exact sequence of G-modules we get a long exact 


sequence in homology and cohomology. We splice these sequences together using the Snake 
Lemma to obtain a long exact sequence extending in both directions. 
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Definition 25.9.1: arenes Let G be a group, S be a subgroup of finite index, and A be a 
G-module. Define the norm N@/s : AS + A® by 


Nejs(a) = >> tya, 
j=l 


where {t1,...,t,} is a left transversal (i.e. coset representatives) of S in G. In particular, 
for S = {1} the norm map is 


Nala) = Na) = (= :) a. 


gecG 
Definition 25.9.2: taiea: Suppose G is a finite group and A is a G-module. Define the Tate 
groups by 
(HG,A),  q>1 
,AC/NA = 0 
| NA/IGA, q= —] 
{ H-a-1(G, A), qd = —2. 


Here yA denotes {a € A: Na =O}. 


Theorem 25.9.3: aoubicics If G is a finite group and 0 — A > B > C — 0 is an exact 
sequence of G-modules, then there is a long exact sequence 


biands HAIG, AY YEG, B) 2 HG.C) | HG A) ae 


Proof. It suffices to prove exactness for g = —1 and q = 0. We apply to the snake lemma to 
obtain the following (the top and bottom rows in the middle are the long exact sequence in 
homology and cohomology, respectively). 


ker N4 ———» ker Np ———> ker No 


Na Nz No 


v Vv Vv 0 
0 ———» H°(G, A) —> H°(G, B) —» H°(G,C) 2 H'(G, A) 


| il | ve 


+» coker(N4) —» coker(Ng) —> coker(Nc) 
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The maps Ny, Ng, Nc are the norm maps on A, B, and C after associating Hp and H® with 


their descriptions in Propositions |25.6.5} and |25.8.2 
nA/IGA —> nB/IcB —»> nC/IgC 


xO 


a A v v v 
AAG, OC) => AjigA => B/ eB ——=+-C/ 160 = _—+6 


Na Ne Ne 


| 


Vv 
» BS /NB ——» Cg/NC 


v 
== AS/NA 


9.1 Complete resolution* 


° The description of Tate groups in the last section is somewhat unwieldy (because you can 
see the glue marks...). We give a different interpretation here, where the Tate groups at 0 
and —1 are less distinguished. Then we use the technique of “dimension shifting” to extend 
results for cohomology (or homology) groups to results for Tate groups. 


Definition 25.9.4: A complete resolution of a group G is an exact sequence X 


do 


> Xy > Xo > X_} » X_»9 ——>->- 


Z 
where each X, is a finitely generated G-free module, € is surjective, and 177 is injective. 


Proposition 25.9.5: complete-resolution Every finite group G has a complete resolution X. 


Proof. Take a G-free resolution of Z and its dual (A* = Homz(A,Z)), and splice them 
together. 


>P; > Po » Z, +0) 


0 > ZS cD ait > Pr base 


>This section will not be used and can be omitted. 
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Proposition 25.9.6: Let G be a finite group, A a G-module, and X a complete resolution. 
Then the Tate groups are exactly the cohomology groups 


H?(G, A) = H"(Home(X, A)). 
Proof. Since any two resolutions are chain-homotopic (going both ways) by the Comparison 
Theorem |25.2.4| it suffices to prove this for one resolution. We take a resolution as in 


Proposition |25.9.5/and apply Homg(e, A) to it. We obtain the following. 


—2 =f 0 1 


--- > Homg(Pi, A) —— Homg(P}, A) —> Home (Po, A) ——> Home (P,, A) ——-:- 


fF 


—-> P,; 2a A — 4p 2G A _@, Homg(Ps, A) —_? . Home(P,, A) ————_ 


L@gGA as Hom (Z, A) 


A/IgA AG 


The isomorphisms on the left are given by the natural isomorphism 


M za A> Homg(M*, A) 
mear (fH f(mja). 
The bent complex along the bottom is the complex for Tate cohomology; some diagram 


chasing gives that these groups are isomorphic to the cohomology groups in the middle 
complex. 


9.2 Dimension shifting 


Given a result or construction in dimension n, we can get the result in dimensions n + 1 by 
utilizing the long exact sequence |25.9.3] and the two propositions. 


Proposition 25.9.7: inauceatateo Let G be a finite group. If M is an induced module then 
Ht(G,M) =0 
for all n. 
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Proof. Since G is finite, induced and coinduced modules are the same. The statement for 
homology and cohomology is Corollary [25.8.9 this takes care of alln 4 0,—-1. Forn =0,-1 
we calculate H2(G, M) directly. Writing M = A®z ZG, we see that every element of m can 
be uniquely written as 7 jcq@y @ g. We find that 


( ) 
Mm eo a aa! 
geG 
( ) 
nM = 29 OS Ds alae 
gEG gEG 


so H°(G, M) = H;'(G,M) =0. 


Proposition 25.9.8: ininducea Let M be a module. Then there exist (canonical) short exact 
sequences 


0- M- M* > M*/M 0 
0+ M'>M,>M-0 


such that M™* is coinduced and M,, is induced, and these sequences are split as abelian groups 
(i.e. as Z-modules, but not necessarily as ZG-modules). 


Proof. The desired maps are 


M <> Coindy;(M) 

mM —+ Pm(g) = gm. 
Z|G] @®z M+ M 
g@mregm 


Splitness follows from the fact that these maps have left and right inverses, respectively: 
p+ y(1) and m+> 1@m. (They are only Z-homomorphisms, not necessarily ZG- 
homomorphisms. ) 


Now suppose G is finite; then coinduced and induced modules coincide. Taking the long 
exact sequence |25.9.3] of the above short exact sequences and using Proposition |25.9.7| gives 


Hi(G, M) & HR-1(G, M*/M) 
HG, M) © H*(G, M’). 


Thus we reduce a problem about cohomology in degree n to a problem about cohomology 
in degree n + 1 or degree n — 1. 
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10 Cup products 


cup-products There is a natural product defined in Tate cohomology. 
Define A@ B to be A ®z B with the structure of a G-module given by g(a ®b) = ga® gb 
(the diagonal action). 


Theorem 25.10.1: tim:cup-proauct Let G be a finite group and A, B be G-modules. There exists 
a unique family of bilinear maps indexed by (p,q) € Z?, together called the cup product, 


U: H#(G, A) x H£(G,B) — HE(G,A@ B), 
satisfying the following four properties. 
1. The homomorphisms are functorial in A and B. 
2. For p= q = 0, the cup product is induced by the map 


A° @ BS = (A@B)*. 


0+ A’ > A> A" 50 
0OS- A @BA ABBA’ OEBA>0 


are exact®, and a” € Hi(G, A"), b © H#(G, B), then 
(da”) Ub = 6(a” Ub) 
in H?*4*1(G, A’ @ B). (6 is the map in the corresponding long exact sequence.) 
4. If 


(2. BS Ba 
0S A®B > ASB AB" 0 


are exact, and a € H#(G, A), b” € Hi(G, B”), then 
a U (6b") = (—1)?d(a Ub") 
in HPtt+1(G, A @ B’). 


Proof. We first define the cup product for cohomology groups and then use dimension shifting 
to define it for Tate groups. 


°Recall e @ B is right exact, so the content is in left exactness. 
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We use the bar resolution’, so that n-chains are functions G” > A. For p,q > 0, define 
U:C®(G, A) x C%(G, B) — C?*4(G, A @ B) 


by 
(f Ug)lai| ++ [ep+q] = F([eil ++ |Zpl) ® g([tp4il -- + |Zp4q])- 


For n = 0, we have (f U g)[] = f[] ® g[] which shows property 2 is satisfied. We® can 
laboriously verify with (25.5) that 


d(f Ug) = (df) Ug + (-1)?f U (dg). 
From this we get a well-defined map 
U: H?(G, A) x H9(G, B) — H°*4(G,A®@ B). 


We can verify properties 3 and 4 by calculation. 

Now we extend this definition by dimension shifting. Suppose the product is defined for 
(p +1,q), we define it for (p,q) as follows. Write A (canonically) as a quotient of a induced 
module as in Proposition [25.9.8] 0—- A’ A, ~ A- 0. Since this is split, so is 


039 A Q@BOA,Q2BOAQB-50. 


Since A, is induced, so is A, ® B (be slightly careful about the G-action here). Thus by 
Theorem |25.9.3| we get H?.(A) = HP*'(A’) and HP*4(A) = H?*9*1(A’@ B) (naturally), and 
thus we can define the cup product 


H?(A) x HF(B)- LY saatiek HP*(A @ B) 


iF iF 


Hr" (A’) x Hp(B) — H(A’ ® B) 


Similarly define it for (p,q) given (p,q+1), but this time introduce a factor of (—1)? (in order 
to make the second condition hold). Note this is consistent with our defintions for p,q > 0, 
by conditions 3 and 4. It is not hard to verify that these maps are well-defined, and that 
conditions 3 and 4 continue to be satisfied. By the way we defined the maps, it also doesn’t 
matter what order we define the maps in (so going from (p+ 1,q¢+ 1) > (p,¢+1) - (p,q) 
is the same as going from (p+ 1,q¢+1) > (p+1,q) > (p,q), for instance). 

Given the map for (p,q), conditions 3 and 4 basically force us to define the map for 
(p—1,q) and (p,q—1) as above. Similarly we can dimension-shift in the opposite direction, 
and we get uniqueness for all (p,q). 


7We can also use the standard resolution (not defined here); in that case the map is (fUg)(zo,---,%p+q) = 
f(0,---,;Lp) @ g(Lp,---,Lp+q)- 
Sie. you 
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Cup products are rather nasty to work with when they aren’t purely in cohomology, so 
if we need to do cup product computation, we work in cohomology whenever possible. 


Proposition 25.10.2: The following hold: 


1. Cup product is associative: For x € H™(G,M), y € H"(G,N), and z € H®(G, P), 
(cUy)Uz=axU(yU2z) (viewing the equation in H™*"*?(G, M @ N @ P). 


2. Cup product is anticommutative: For c € H™(G,M) and y € H"(G,N), rUy = 
(—1)"™"y Ua. 


Proof. Omitted. The idea is to verify the formula in degree 0 and then dimension-shift to 
get the general case. 


10.1 Cup product calculations 


To compute the Artin map in class field theory, we will need to calculate the cup product 
of things in dimensions —2 and 2. We will get there incrementally using dimension shifting 
and properties 3-4 of the cup product, first calculating the cup product on dimensions (0,7) 
(especially (0,1)), then on (—1,1), and then finally on (—2, 1). 


Theorem 25.10.3: tim:cup-proa-cac Let G be a finite group and A,B G-modules. If a € A®, 
let a@° denote its image in H9(G, A), and if Na = 0, let @ denote its image in Hp'(G, A). 
For g € G let g denote its image in G/G’ = H7°(G, Z). 
1. (0,n). Suppose n > 0, a € A®, and x € H2(G,B). Let f,: B + A®B be the map 
sending y to a @ y; it induces a map H2(G, A) > H2(G,A® B). Then 


a. x 
“Ya” “Ya” 
EHO(G,A) €H2(G,B) 


= fa(z) € HP(G,A® B). 


2. (—1,1). Suppose Na = 0, and [f] € H'(G, B) is represented by a cocycle f : G > B. 


Then 
0 


@  U ffl =(-Duese) 


“Ya 
a teG 
eH; (G,B) €H4(G,B) 


3. (—2,1). Let s € G and [f] € H'(G,B). Then 
oes 
€H7"(G,Z) €H1(G,B) 


Proof. We omit details of the calculations. See Serre [Ser79|, pg. 176-178. 


1. For n = 0, this follows from definition of cup product. Now use dimension shifting, 
with the exact sequence 0 > B > B* + B*/B — 0, B* coinduced. 
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2. Dimension shift from part 1 with 0 — B > B* + B*/B — 0: suppose b” € (B*/B)° 
is sent to f under the diagonal morphism. Write GU f = G U d(b””) = —d(G U b””) 
and use part 1. 


3. Show that 


d(s U[f]) = d(f(s)o). 
Evaluate the LHS using property 3 and part 2. 


11 Change of group 


change-of group We would like to be able to connect (co)homology groups corresponding to 
different groups G, G’ and different modules over G, G’. This will allow us, for example, to 
define maps 


Res” :H"(G, A) > H"(S, A) 
Cor, :H,(S, A) > Hn(G, A) 
Inf” :H"(G/S, A°) > H"(G, A) SAG. 


11.1 Construction of maps 


For there to be a map H"(G, A) > H"(G’, A’) we need there to be a map G’ > G, with 
some compatibility condition on the modules A, A’. 


Definition 25.11.1: Let G,G" be groups, let A be a G-module and A’ be a G’-module. 
A cocompatible pair is a pair (a, f) where a : G’ > G is a group homomorphism and 
f : A— A’ is a Z-homomorphism such that 


f((aa')a) = a’ f(a) 


for all vw € G’ anda € A. 
(Cees e 


a 


/ 


Let ((Pairs*)) denote the category whose objects are pairs (G, A) and whose morphisms are 
cocompatible (a, f). 

Define a compatible pair to be a pair (a, f) where a : G > G’ is a group homomorphism 
and g: A — A’ isa Zhomomorphism such that 


f(wa) = (ax) f(a) 
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for all x EG. 
Ce 


I 


AA 


Let ((Pairs)) denote the category whose objects are ordered pairs (G, A) and whose mor- 
phisms are compatible (a, f). 


Given a cocompatible pair, let P’ be a G’-projective resolution of Z and P be a G- 
projective resolution of Z. By the Comparison Theorem |25.2.4| there is a chain map T(q) : 
P' + P induced by the map lz: Z + Z and a, unique up to homotopy. Define 


C"(G, A) = Homza(Pr, A) = Homza(P", A’) = C'(G", A’) 
pre fopor(a)”. 


Similarly, for a compatible pair, there is a chain map T(a) : P + P’ induced by 1z: Z > Z 
and a; we get a map 


TG) & 7 ; C,,(G, A) =P, 2G A- Pe 2a Al = C,(G', A’) 
These maps descend to cohomology and homology, respectively. 


Definition 25.11.2: Define the maps below using the (co)compatible pairs shown. 


Name Map onG | Map on M Map 
Restriction | i:S3G | MEM | Rest)5: H"(G,M) > H"(S,M) 
Corestriction | i:5—4G M=>M Corgg : Hn(S,M) > An(G,M) 
Inflation | q:G—>G/S|MeM® | Infeo: H°(G/S,M*) > H"(G,M) 
Conjugation | oa gog! | gtmum H"(G, M) > H"(G, M) 


For inflation, we require that S < G (S be a normal subgroup of G). 


Proposition 25.11.3: change-group-conjugation The conjugation Map A"(G, M) —> A"(G, M) is 
the identity. 


This is important because when we are defining maps between different cohomology 
groups, we can be assured that conjugation won’t change it, i.e. we have a canonical map. 


Proof. For n = 0 this is the identity map M° + M®. Since the conjugation H"(G,M) > 
H"(G, M) is a map of cohomological functors, and the identity map H"(G, MW) > H"(G, M) 
is also a map of cohomological functors, and they agree for n = 0, by Theorem 2) 
they must be equal for all n. 
Alternatively, use dimension shifting. 
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11.2 Extending maps to Tate cohomology 


corestriction Right now Res” is only defined on cohomology and Cor,, is only defined on homology. 
We would like to define them on Tate cohomology. 


Proposition 25.11.4: prextendto-tate Let G be a finite group. The maps Res” and Cor, can 
be defined on Tate cohomology, such that the definitions for H7 agree with the original 
definitions on cohomology and homology for n > 0 and n < —1, respectively, and such that 
Res and Cor are natural transformations compatible with forming the long exact sequence 
in homology and cohomology from a short exact sequence. Moreover, Res” and Cor, satisfy 
the following properties. 


1. Corgg: HP(S,M) + H}(G, M) is the map Ne/g : M5/NsM > M°/NeM. 


2. Resqis : Hp'(G,M) > H7'(S,M) is the map Cg/s : nepM/IgM > v,M/IsM, where 
C@/s is the conorm map defined by 


Cays(a) = Ds i 


where {t;} is a left transversal of G/S. (Equivalently, let {t;} be a right transversal 
and let Cgyg(a) := ¥; tia.”) 


3. Corgig : Hp?(S,M) > H7?(G,M) is the natural map S?? + G*>. (See Proposi- 
tion |25.8.3}) 

Proof. First, the construction. We will use Theorem |25.5.2| Let x be the class of coinduced 

ZG-modules. Note that the category of ZG-modules has enough coinduced ZG-modules, by 

Proposition |25.9.8| Note that {H7(S,es)} and {H7#(G,e)} are cohomological 0-functors on 


the category of ZG-modules, with respect to x (by Ms, we mean think of M as a S-module). 
Indeed, any coinduced module for G is coinduced for S by Proposition |25.8.6}!° Since 


Res¢jg: M°/NgM > M®/NsM, Cory! : ysM/IsM — noM/IcM 


are natural transformations, Theorem 1) applies to give unique morphisms Res and 
Cor extending Ng gs. (They agree in cohomology and homology with the original definitions 
by uniqueness in Theorem )). 

Alternatively, we can extend the definitions of Res and Cor using dimension shifting 
(which is simpler, really)." 


°To see this, note t15 = teS iff ty ‘te € S, iff Sty?’ = Sty’. 

10Note this would fail if we take x to be the class of ZG-injective modules, as ZG-injective modules are 
not necessarily ZS-injective. 

"Alternatively, we can construct Cor” explicitly as the map 
Shapiro 


H"(S,M) H"(G, Coindg M) > H"(G, M) 


where the last map is the change of group map induced by G & G and Coindg M — M given by dG 
>, tiv(t;'), for some transversal {t;} for S in G. This is just the norm map in dimension 0. 
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We now calculate the maps using dimension shifting. 


1. Use the short exact sequence 0 > M’ > M* = ZG ®z M > M — 0 from Proposi- 
tion |25.9.8]to get the vertical isomorphisms in the diagram on the left below. (Note as 
before that M* is both G and S-(co)induced.) 


Corz! 
Hz'(S,M) —5 H7(G, M) n~M/IsM —> noM/IeM 


sls sls wstiee) |= no(ion) | 


0 
Corg/g 


The left-hand diagram gives the right-hand diagram, after noting that 6 is the map in 
the snake lemma in the proof of Theorem |25.9.3} From the right-hand diagram it is 
clear that the bottom map has to be Ng gs, because Ngjg 0 Ns = Na. 


2. From 03 M > M* 4 M*/M — 0 we get the commutative diagrams 


=I 
Resajs 


H7'(G, M*/M) —5 H7(S,M*/M) — Hz(G, M*/M) —> H7"(S, M*/M) 


sls |= not | sor | 
Res? 


HAS. M). M?/NgM ————> M°/NsM. 
From Ng = Ng 0 Ce/s, the top map has to be C@/s. 


3. Recall the isomorphism H,(G?>, Z) ~ G?> was defined using the horizontal maps below. 


Hy(S,Z) 24 Ho(S, Is) —= Is/I3 —> S/S" 


Lom [om | 


H,(G, Z) 4 Ay(G, Ic) —= Ie/I, —+ G/G 


The left square commutes by functoriality of Cor and the right rectangle commutes by 
tracing the map in Proposition |25.8.3 


11.3 Further properties 


Theorem 25.11.5: comes Suppose H is a subgroup of G of finite index. Then Cor” o Res” is 
multiplication by [G : H]. 


Proof. In degree 0, we have Cor? o Res” = [G : H] because Nex is just multiplication by 
[G : H] on M®. As in the proof of Proposition |25.11.3} the general case then follows from 
either Theorem |25.5.2| or dimension shifting. 


Corollary 25.11.6: hn-torsion 
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1. If G is finite, then |G|H”"(G, M) = 1 for any n > 0. 
2. If G is finite and M is finitely generated as an abelian group, then H"(G, /) is finite. 
Proof. 
1. By Theorem [25.11.5} 
H"(G, M) 28 H"(1, M) 3 H"(G, M) 
is multiplication by |G]. But H"(1,M) =0. 


2. By the explicit description of H"(G, M) using the bar resolution, H"(G, M) is finitely 
generated. By item 1 it has finite exponent, so it must be finite. 


Corollary 25.11.72 es-inj-p-prim Let G be a finite group and G, its p-SSG. For any G-module 
M, the map 
Res” : H"(G, M) > H"(G,, M) 


is injective on the p-primary component. 


Proof. Suppose that « € ker(Res). Then [G : G,]z = Coro Res(a) = 0. Since the order of x 
is a power of p but pf [G: G,], we get that x = 0. 


Corollary 25.11.8: corati-cp-o If H(G,, A) = 0 for all primes p then H7(G, A) = 0. 
We will also need to know how restriction and corestriction affect cup products. 
Proposition 25.11.9: :escup The following hold. 
1. Res(a U y) = Res(x) U Res(y). 
2. Cor(a# U Res(y)) = Cor(x) U y. 


Proof. See Cartan-Eilenberg [CE56], Chapter 12, or Atiyah-Wall in Cassels-Frohlich [CF69]}, 
p. LO7. 


11.4 Inflation-restriction exact sequence 


Proposition 25.11.10: intateresirice Suppose H << G, A is a G-module, and n > 0. If 
H’(H, A) = 0 for all i with 0 <i <r, then 


0 H"(G/H, A") 2s H"(G, A) 2 H’"(H, A) 
is exact. 
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Proof. We first prove the case r = 1. We show the following. 


1. ResoInf = 0: Change of group is functorial (easy to see from the definition), so 
Res o Inf is induced by the maps G/H + G < H and M¥ © M & M. The first map 
is 0 so Reso Inf = 0. 


2. Inf is injective: Suppose f : G/H — A? is a cocycle such that Inf([f]) = 0. Note 
Inf ([f]) = [fop] where p : G + G/H is the projection. Inf([f]) = 0 means f(s) = sa—a 
for some a € A. Since f is constant on cosets, sa — a = sta — a for all t € H, giving 
ta=a, anda e€ A”. Thus [f] = 0 in H1(G/H, A”) =0. 


3. ker(Res) C im(Inf): Suppose f : G > A is a cocycle such that [f] € ker(Res). Since 
Res[f] = [f 07], this means f(t) = ta — a for some a € A and all t € H. Define the 
coboundary g : G > A by g(s) = sa—a for all s € G; let f; = f —g; we have [fi] = [/]. 


Now f; = 0 on H, and by definition of cocycle, 


filst) = fils) + si). 


Letting t range over H, we get that f,(st) = f(s), ie. f is constant on cosets of H. 
Letting s € H we have f(st) = sf(t), so im(f) is invariant under H. Thus f descends 
to f: G/H > A", ie. f € im(Inf). 


Now we proceed by induction. Suppose the proposition holds for r—1. By dimension-shifting 


(Proposition |25.9.8), the exact sequence 
aqundres-ctaril) — A —> A* => A*/A —> 0 (25.7) 


with A* coinduced gives 0"~! : Ht-1(G,A*/A) = Hi(G,A). We now show there is a 
commutative diagram 


0 — H’-1(G/H, (A*/A)#®) B35 Hr-1(G, At /A) 22S Ar-H, A*/A) 


lors lon [on 


0 ——_> H"(G/H, A¥) —™" _., y"(G, A) —2 A’ (H, A). 


where all the vertical arrows are isomorphisms. We already know this for the middle arrow. 
Since A* is G-coinduced, it is H-coinduced (Proposition [25.8.6), so the right vertical 
arrow is an isomorphism. 
Since H'(H, A) = 0, taking cohomology of gives the exact sequence 


0— A® = (A*)? = (A*/A)? = 0. 
Recall A* = Hom(Z/[G], A), so (A*)” = Hom(Z[G/H], A) is G/H-coinduced. Thus we get 


the left vertical arrow is an isomorphism. 
By (cohomological) functoriality of Inf and Res, the diagram commutes. 
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11.5 Transfer 


hom-transfer Especially important for our purposes will be the restriction map on the first ho- 
mology group. 


Definition 25.11.11: The map Vc¢_.s defined by the diagram below 


H,(G,Z) ——G/G’ 


|e | vous 


His, 2) 8/3" 
is called the transfer or Verlagerung. 
(The map Res defined on Tate cohomology in Section also gives a map on homology.) 


Proposition 25.11.12: prcompute-transter Let G be a group and S be a subgroup of finite index. 
The transfer is given by the following: Let {l,,...,/,} be a left transversal of S in G. Then 


Resi(g) = [J 9S" 
i=l 


where the g; € S are such that gl; = [,()9n(s) for some permutation 7 € S,,. 


Proof. By functoriality of Res we have the commutative diagram (cf. Proposition |25.8.3) 


H,(G,Z) 2+ Ho(G, Iz) —= Ie/I2 
Bs | Reo=ce /s |e /8 
O1 


A, (S, Z) —> H(S, Ig) ——Ig¢/IsIe 


an ] 


Hy( 8,15) Is / 12 


where the top two 0,’s are from the exact sequence 0 > Ig > ZG > Z > 0, the bottom 0, 
is from the exact sequence 0 > Iy — ZH — Z — 0, and the lower right square is induced 
by the inclusion [7 <> Ig. Replacing H, with G?», we get 


Ce sig. 


|voos |ecis 


S/S’ —+I[¢/Isle 


NS J 


T/T 
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Given g € G/G’, it maps to g — 1 in Ig/I%. We have 


n n n n 


Cejs(9g—-1) = SoU (g-) = alr ag—l = > (Gi -DIF ry =S\(gi-1) (mod IsIg). 


i=1 i=1 i=1 i=1 


The inverse image of this in S/S” is T]"_, g:S’, as needed. 

Theorem 25.11.13: transtero Let G be a finite group. Then the transfer map 
VG? > (G)” 

is Zero. 


Proof. See Neukirch, [Neu99} V1.7.6]. The proof uses the computation in Proposition|25.11.12 


This will be important when we study the Hilbert class field. 


12 Cohomology of cyclic groups 


cyclicgroups The cohomology of cyclic groups is especially easy to understand, and will be 
very useful to us: when L/K is an unramified extension of local fields, the Galois group 
G(L/K) = G(I/k) is cyclic. 


Theorem 25.12.1: is.42 Let G be acyclic group and x a generator. Let y, € Hom(G, Q/Z) = 
Hi(G,Q/Z) be the homomorphism sending zx to at Let 6: Hi(G,Q/Z) > H2(G,Z) be 
the diagonal map from the exact sequence 0 > Z > Q > Q/Z — 0. The map eUdx, gives 
an isomorphism 


[as 


H7(G,M) > Hy(G, M) 


for all G-modules M and r € Z. 
Hence for all n € Z, 


where D is multiplication by «—1 . 


Proof. Since Q is a divisible group, so is H"(G,Q), by looking at the description of H” in 
terms of cocycles (Section |7). Hence 6 : H}(G,Q/Z) + HZ(G,Z) is an isomorphism and 
OX is a generator of H2,(G, Z). 

The short exact sequence 0 > Ig + ZG — Z — 0 splits because G is cyclic: 


0— Ig == ZG ——- Z=——- 0 
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where € oace Qg9) = Vyec 4. Now ZG has trivial Tate cohomology by Proposition 25.9.7, 
so the diagonal maps in either direction are isomorphisms: 


H3(G,Z) + H4(G, Ie) “4 H2(G,Z). 


Thus we can write dy, = 6°6'c for a generator c of H®(G,Z) = Z/|G|Z. Then by Theo- 
rem |25.10.1[4), 
bU dx, = bU Sle = 5°6'(b Uc). 


eUc 


It suffices to show that the map H7.(G, M) —> H7(G, M) is an isomorphism. But this map 

is just multiplication by c for r = 0, so it is multiplication by c for all r. Now by Proposi- 

tion [25.11.6] (true for r > 0 and hence true for all r by dimension-shifting) |G|H7(G, M) = 0. 

As c is a generator of Z/|G|Z it is relatively prime to |G|; hence multiplication by c is an 

isomorphism on H!.(G,M). This shows the isomorphism H'.(G,M) — H1*?(G, M). 
For the second part, note Hp'(G, A) = yA/DA and H9(G, A) = A&/NA. 


Corollary 25.12.2: comexact-nex Let G be a finite cyclic group. Suppose that 1 > A— B—- 
C' — 1 is an exact sequence of G-modules. Then there is an exact hexagon 


exact-hexagon HUG, A) ati H&G, B) (25.8) 


Proof. We have Hi(G, A) = H(G, A). 


12.1 Herbrand quotient 


herbrand 


Definition 25.12.3: Let G be a finite cyclic group and A a finite G-module. Define the 
Herbrand quotient to be 


|H7"(G, A)| 


h(A) = h(G, A) = [H2="(G, A)| 


for any n. 


This is well-defined by Theorem |25.12.1 


The following key properties of the Herbrand quotient will help us in computations. 
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Proposition 25.12.4: herbranai Let G be a finite cyclic group. The Herbrand quotient sat- 
isfies the following. 


1. If A is a finite G-module, then h(G, A) = 1. 


2. (h is an Euler-Poincaré function) If 1 — A —- B— C — 1 is an exact sequence of 
G-modules, then 


h(G, B) = h(G, A)h(G,C). 
(If two of these are defined then the other is defined.) 
3. If G acts trivially on Z, then h(G, Z) = |G. 
4. If f : A— B has finite kernel and cokernel, then h(A) = h(B). 


Proof. 1. We use Theorem |25.12.1/to calculate the quotient. We have the exact sequences 


0 > vA -A- “sn A+ 0 ——> ker D — A ——> DA ——+ 0. 


AG 
Hence 
|NAllwA| = |A| = |AS||DAI, 
giving 
|H"(G,A)| = |wA/DA] = |AC/NA| = |H°(G, A)]. 
2. Keeping the notation in the hexagon |25.8| we have 


|H°(G; A)| 


H°(G, A) = | ker f,| E ~Tker fil 


We can similarly calculate the other quantities to get the result. 
3. Let |G] =n, and [n] denote multiplication by n. We have 


0 G 
nG,2) - MGB _ (ZINA _ C/n2\ _ |e 


_— = = = |G). 
Hp '(G,Z)| ~ IwZ/ToZ| ~ [kerf] 1 


4. The exact sequence 1 — ker f — A — B - coker f — 1 gives h(G,ker f)h(G, B) = 
h(G, A)h(G, coker f) (split the exact sequence into 2 short exact sequences and use 
part 2). The result now follows from part 1. 
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13. Tate’s Theorem 
tate-thm-section Our main goal in this section is to prove the following. 


Theorem 25.13.1 (Tate’s Theorem): tate-inm Let G be a finite group and M be a G-module. 
Suppose that for all subgroups H CG, 


1. H'(H,M) =0 and 
2. H?(H, M) is cyclic of order |H]. 
Then given a generator u € H?(G, M), there is an isomorphism 
HY(G,Z) 4 HE? (G,M) 
for all r. 


This is the main application of group cohomology to class field theory, as this will be the 
inverse of the Artin map: for instance, in local class field theory we have 


Hy? (G(L/K),Z) = G(L/K)” 
H}(G(L/K), L*) = (L*)C29 { Nmzjx(L*) = K*/ Nmzx(L*). 


The conditions of Tate’s Theorem may seem unmotivated, but keep in mind that they are 
basically the key conditions satisfied in the number-theoretic setting, when G is taken to be 
a Galois group and M is taken to be a field (or idele group). 

Class field theory was initially proved without group cohomology, but group cohomology 
gives a much nicer way to organize and abstract the proof. This theorem is a key part 
of that abstraction: isolating the key number-theoretic conditions that result in the Artin 
isomorphism. In proving both local and global class field theory, we will spend significant 
time showing that the hypothesis of Tate’s Theorem holds. (The key difference in local and 
global class field theory is that we put in different things for /.) 


Proof. Serre |Ser79], Section IX.8. 


14 Profinite groups 


profinite-cohom In this section we study the cohomology groups when G is a profinite group. In 
this case topology becomes important. We will apply the results when G is an infinite Galois 
group. 

We find that we have two ways of interpreting the resulting cohomology groups: 


1. Imitate the previous construction but work in the category of topological G-modules 
instead. Le. feed in “category of topological groups” into our cohomology functor. 
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2. Take the direct limit over finite quotients of G. 


Definition 25.14.1: tp-<moa A topological G-module is a G-module that is a topological 
group, and such that the map 


yp:GxMoM 
(g,m) + gm 


is continuous. 
We will always give M the discrete topology, so this is equivalent to the following condition: 


M= U Fae 


H open subgroup of G 


Indeed, because M has the discrete topology, for the action to be continuous, 7g(y~!(m)) 
must be open, where 7g : G x M — G is the projection. This is just the stabilizer of m, 
so the stabilizer of m must contain an open subgroup of G. Hence, every m € M must be 
contained in some M”., 

We define H"(G,M) as before, but now in the category of topological G-modules, i.e. 
we replace every instance of Home with Hom@™, since in this category the morphisms are 
continuous G-homomorphisms. Note that the category of discrete G-modules has enough 
injectives. 


Theorem 25.14.2: profinitetimz Let G be a profinite group. We have 
H"(G, M) = lim H"(G/S, M*) 
where the limit is over open normal subgroups S and the maps are the inflation maps 


Inf” : H"(G/S,M*) 4 H"(G/T,M"), SDT. 


Proof. Milne |Mil08}, IT.4.2. 


We have a similar result if we take the limit over M. 


Proposition 25.14.32  prtrcommutes-im Let G be a profinite group and suppose M = lim H"(G, M;) 
is a discrete G-module, and each M; injects into M. Then 


H"(G, M) = lim H"(G, M)). 


Proof. Milne |Mil08}, I.4.4. 
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15 Nonabelian cohomology 


nonabelian-cohom In this section we define cohomology H"(G,A) when A is non-abelian. (It was 
okay for G to be non-abelian because we saw it in the guise of ZG, but we needed A to be in 
an abelian category.) The cohomological construction fails and we instead imitate the results 
of Theorem (The description of H' and H? in Theorem are useful because 
derivations and factor sets are used to classify a lot of things.) 

We will only be able to get a “piece” of the long exact sequence. Cohomology also lacks 
a lot of structure: we speak not of cohomology groups, because they are now only pointed 
sets. We write A multiplicatively, as is the convention for nonabelian groups. 


Definition 25.15.1: The category of pointed sets is the category whose objects are pairs 
(A,a), where A is a set and a € A, and such that a morphism (A,a) — (B,}) is a function 
A — B taking a to b. 

The kernel of f : (A,a) — (B,) is f~*(b). Thus we can define an exact sequence of 
pointed sets. 


We now define the cohomology (pointed) sets. These will coincide with the definition in 
the abelian case by Theorem |25.7.2| except we only retain the structure of a pointed set. 


Definition 25.15.2: df:nonabelian-cocycles Let G be a group and A a group with G-action. 


1. Define 
H°(G, A) = A° := {a € A: sa=a for all se G}. 


The distinguished element is 1. 
2. Define a 1-cocycle to be a map d: G > A such that 
day) = d(x) - ad(y) 


and let Der(G, A) be the set of 1-cocycles. Two cocycles d;, dz are cohomologous if 
there exists a € A so that!” 


do(x) =a~'- d,(x) - aa. 


Note this is an equivalence relation; define H'(G, A) to be the pointed set of 1-cocycles 
modulo equivalence. The distinguished element is the unit cocycle d(x) = 1. 


For an exact sequence of non-abelian G-modules 
13> ASBA051 


with 7(A) < B, define the coboundary operator 6 : H°(G,C) — H'(G, A) as follows: 
given c € G@, choose any b € p-'(c) and set 


6(c) = d where d(s) =i-'(b"'s(b)). 


The analogue in the abelian case was do(x) = —a + d(x) + xa. 


309 


Number Theory, §25.15 


If furthermore i(A) is in the center of B (so A is abelian), define A : H'(G,C) > 
H?(G, A) as follows: for d. € H'(G,C), choose dy such that p,d, = d,, and set 


[A(d)] (x, y) = do(s) - s(do(t)) - dp(st)~*. 


Proof of well-definedness. Note the coboundary operator is defined by imitating the con- 
struction in the snake lemma. 


ae. - 
A eins meee 2B ——— sO b ————_ 3¢ 
«| a: 
Der(G, A) ——> Der(G, B) (s+ 1*(b-'s(b))) —+ (s + b's(6)) 


We need to show that s +> b~1s(b) is actually a cocycle; its image is in A because s(b) = 071 
(mod 7(A)) by exactness; show that the cohomology class is independent of the choice of b. 

The second part is similar. Everything is easy to prove so we omit it. See Serre [Ser79}, 
Appendix to Chapter VII. 


Theorem 25.15.3 (Exact sequence in nonabelian cohomology): thm:nonabetiantes Let 1 + A + 
B *, C + 1 be an exact sequence of non-abelian G-modules. Then the following is exact. 


1 — H°(G, A) 2+ H°(G, B) 5 H°(G,C) 4 H\(G, A) > H1(G, B) 24 H1G,C) 
a 


H2(G, A) 


(with the last map present if A is in the center of B). 


310 


Chapter 26 


Introduction to Galois cohomology 


galois-cohomology-ch We will apply group (co)homology as follows: Take a Galois extension L/K 
and let G := G(L/K). Take as a G-module a multiplicative or additive subgroup S' of L. 
The special case that G is cyclic will come up often, since if L/K is an unramified extension 
of local fields, then G is cyclic. Furthermore, the norm map Ng has a natural interpretation: 


1. If S C L* then fora eS, 


Ne(a) = Jf o(a) = Nmz x(a). 


o€G 
2. If S C Lt then forae S, 


Ne(a) = S5 o(a) = trzx(a). 


o€G 


In Section |2} we give an application to Kummer theory (characterizing certain abelian ex- 
tensions L of K in terms of L*" kK). Kummer theory will allow us to prove the linear 
independence of nth roots. 

Finally, we give two interpretations of Galois cohomology groups. 


1. H'(G(L/K), Aut(V)) parameterizes algebraic structures defined over K that become 
isomorphic in L (Section B). This is called descent. 


2. H*(G(L/K), L*) parameterizes classes of K-algebras “split” over L (Section|4), i.e. it 
is the Brauer group. 
1 Basic results 
galois-cohomology We prove two fundamental theorems on the cohomology of L* and L*. 
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Theorem 26.1.1 (Hilbert’s Theorem 90): n90 ({) Let L/K be a Galois extension with Galois 
group G. Then 
AG i) S11) 


Moreover, if G = (a) is cyclic and u € L*, then the following are equivalent. 
1. Nmzx«(u) = 1. 
2. There exists v € L* such that u = o(v)v7t. 
We will often abbreviate H'(G(L/K), L*) as H'(L/K). 


Proof. First suppose G is finite. Let c: G > L* be a 1-cocycle; we have c,, = o(c,)Cg- 
Consider the function 
be) s= 7 Gre). 
TEG 
By linear independence of the characters T € G, b is not identically zero; hence there exists 
e € L* so that b(e) 4 0. Operating by o on both sides and using the cocycle condition gives 


eanoopt7(b(e)) = > o(c,)(or)(e) = >> Cor€g-1(oT)(e) = 5 'b(e) (26.1) 


TEG TEG 


and cs = b(e)a(b(e))~+, so c is a coboundary. 
The infinite case follows from the finite case and Theorem [25]25.14.2 
For the second part, note that H'(G, L*) = ker(N)/im(D) = 0 gives ker(N) = im(D). 
Here N is the norm map Nm; x and D is the map o — 1, i.e. the map v > aoe 


Next we think of L as an additive group. 
Theorem 26.1.2: 1+0 Let L/K be a finite Galois extension. Then 
HG St, 7 S0, 


Proof. From the normal basis theorem |12/12.4.3| there exists a € L such that {ca : 0 € G} 
is a basis for L over kK. We get that K|G] = L as G-modules by the map 


> Ago > S- Agod. 


o€G o€G 


Since K[G] = Ind? (Kk), 
H'(G,L*) = H"({1}, K) =0 


by Shapiro’s Lemma]25.8.7 
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2 Kummer theory 


kummer We use Galois cohomology to prove the following. 


Theorem 26.2.1 (Kummer theory): kummer-theorem Suppose K is a field containing a primitive 
nth root of 1. Then there is a bijection between 


1. Finite abelian extensions of K of exponent dividing n (i.e. for any o in the Galois 
group G(L/K), 0” = 1). 


2. Subgroups of K* containing AK" as a subgroup of finite index (i.e. subgroups of 
Kepke™). 


This correspondence is given by 


LH K* nN L*" 
K[B™] HB. 
Moreover, 
degree-order| L 7 iq = [k~* ‘a Le 7 k*| (26.2) 


(Note in the reverse map, which nth roots we take doesn’t matter because K contains nth 
roots of unity.) 
In the course of proving this theorem, we will show the following useful proposition. 


Proposition 26.2.2: pr:kummerchar Let K be a field containing a primitive nth root of 1 and 
L/K an abelian extension with Galois group G. Then there is a natural isomorphism 


K* 0 L*"/K*" & A"(G, un) = Hom(G, pin) 


1 
o (an) 
at Ct a i FI 
an 


In particular, there is a natural isomorphism 

K*/K*" © H'(G(K*/K), pin) = Hom(G(K*/K), ttn). 
Proof. Let G = G(L/K), and denote the forward map by B(L) = K*M L*". The key 
step is showing that (26.2) holds; we do this by interpreting K* 9 LX” as a Oth cohomology 


1 


module. The inclusions L D> K(B(L)") and B(K(Bx)) D B are easily seen to hold (Step 
2), so (26.2) will give that equality holds (Steps 3-4). 


Step 1: By Theorem |25]25.4.6| the short exact sequence of G-modules 
—— | 
cree” 


1 > pn > L* ——> L*" > 1 
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induces the long exact sequence 
1 — H°(G, u,) @ H°(G,L*) 4 H°(G, L*") = BG, un) > A'(G, L*®) 3 
We need not go further because Hilbert’s Theorem 90 (Theorem [26.1.1) tells us 
HG) Bi) 1, 


Next, note that H°(G, H) is simply the subgroup of H fixed by G, and that the subfield of 
L fixed by G is K. As pu, C K, G acts trivially on yw, and H'(G,u,) = Hom(G, un) by 
Corollary |2525.7.3} The sequence becomes 


ij R* KA = Honig) 1, 


giving an isomorphism 
K* 7b" 7k" = Hom(G, pn). 


o(am) 


The map is 01(a) = (o rH ae 


i as shown by tracing through the construction in Theo- 
rem |25}25.3.3| This proves Proposition |26.2.2 


Te a 
jie a » L* pea” L* ae _ are a 
dy dy dy 
pa i 1 at 
Der(G, pty) —> Der(G, L*) (0 = “or ) i (o = “or 
We claim that | Hom(G, ju,)| = |G]. Indeed, by the structure theorem for abelian groups, 
G decomposes as (Z/n,Z) x-++-x (Z/nmZ) where n1,...,%m | n. To choose a homomorphism 
for G means choosing images for the generators of Z/n,Z,...,Z/mmZ; there are n1,...,%m 
possibilities, respectively, for a total of |G]. 
Then 


[L: K] =|G(L/K)| = [KX nL": K*]. 


This shows (26.2). 


Step 2: Next note the following two inclusions. 

1. K[B(L)*] C L: Anything in (K* M L*")* is in the form (8")* and hence in L. 

2. B(K[Bx]) D> B: Anything in B is in the form (b)" and hence in KX 9 K(Br)*”. 
Step 3: We show that K[B(L)=] = L. By the inclusions in step 2, 


(26.2) 


[L : K] > [K[B(L)*] : K] = [B(K[B(L)*)) : K*] > [B(L) : K*. 
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But [LZ : K] = [B(L) : K*] by (26.2), so equality holds everywhere. The first equality gives 
the conclusion. 


Step 4: We show that B(K[Bx]) = B. We apply step 1 with L = K[Bz] to get the 
isomorphism 


~ 


B(L) = K* nN L*"/K*" — Hom(G, pn) 


ar [o e) 


Now B C B(L) gets mapped to a subgroup H’ C Hom(G,p,), which can be identified 
with Hom(G/H, u,)!. But as the a generate L over K and the fixed field of G is K, 
nen kerh = 1. Thus H = {1}. Hence |B(L)| = |G| = |BI, and B = B(L). 


Corollary 26.2.3 (nth roots are linearly independent): Let S be a set of nonzero integers 
so that ¢ is not a perfect nth power for any distinct a,b € S. Then the elements 


v/s, ses 
are linearly independent over Q. 


Proof. Step 1: It suffices to show that for distinct primes p,,...,p,, we have 


roote-right-degree|Q(2/D1,..., */Dk) : Q] =n". (26.3) 


Then a basis for this extension over Q is formed by taking products of basis elements for the 


QC y/P;): 
basis-sqrt { Vpt ---pR :0< a; < n} (26.4) 


However, the radicands are exactly the representatives of elements in Q* /Q*". The elements 
of S' are all represented by distinct elements of modulo Q*, so the theorem will follow. 
(To deal with s € S negative, note if s is negative then %/s is a not in R.) 

We want to use Kummer theory to conclude (26.3). However, Q only has square roots 
of unity (+1), so we have to consider all other roots separately. We may as well assume 2 | n. 


'The subgroups of G are in bijective correspondence with the subgroups of Hom(G,, w,) via the map 


H *s {h € Hom(G, pin) : H C ker hh} & Hom(G/H, jin) 


() kerh & # 
heH’ 


Indeed, clearly U(®(H)) > H, and we have equality since for every g € G\H we can find h € Hom(G, pu,) 
with kernel containing H, so that h(g) 4 1. Since Hom(G, j1,,) & G, they have the same number of subgroups, 
and this is a bijection. 
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Step 2: We first show 
roots-indpt-indext[Q(4/P1, ry /Pk) i Q) = OF (26.5) 
Let B be the subgroup of Q* generated by pj,...,pz and Q*?. By Theorem |26.2.1| 
(Q(B?) : Q] =[B: 7] = %, 
as needed. 


Step 3: We now adjoin nth roots of unity such that we can apply Kummer theory for nth 
roots. Let N be a positive integers such that n | N and Q(,/pi,..-,./Pr) © Q(¢n) (every 
quadratic extension is contained in a cyclotomic extension; we can take N = 4p; --- ppn). 


However, if we look at K := Q(¢y), what if elements that aren’t nth powers in Q become 
nth powers? Fortunately, this doesn’t happen for n 4 2. We show that for even n 4 2 and 
m € Q not a perfect }th power, 


root-not-in-cyclotomic wm Z Q (Cn) . (26 6) 


By taking roots, we may assume that m is not a perfect dth power for any d | n. 

Note L := Q(%/m, ¢,) is a Galois extension of Q since it is the splitting field of X” — m. 
Note X" —m is irreducible over Q because the constant term of any proper factor must be in 
the form m ¢ Q where 0 < j <n. Hence there exists T € G(L/Q) sending %/m to ¢, %/m. 
Let o € G(L/Q) denote complex conjugation. Then 


or( am) = o(G. 9m) = Gt Vm 
roam) = r( sm) = Gm. 


Hence G(L/Q) is not abelian. Since all cyclotomic extensions are abelian, L cannot be 
contained in an abelian extension, giving (26.6). 
Let C be the subgroup of Q(¢y)* generated by \/pi,..., ,/px and Q(¢nv)*2. We showed 


7 
above that %/m ¢ Q(¢y)*? for any m not a perfect 2th power so [C : Q(¢y)*?] = ee By 
Kummer Theory, 


(lc, Pi ++ PR)  OGw)] = (C2) : K] = [0  OGw)*4] = (8) 

Since Q(/Pir---+ Br) QC) we get 
roots-inaptindex2[Q( 2/Diy ---, 9/Pw)  Q(/Pay--- 5 /Px)] = (S)" (26.7) 
Combining and gives (26.3), as needed. 
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3 Nonabelian Galois cohomology 


sec:nonabelian-galois-cohom Because of the definition of H+ in Section [25]15} we find that we can often 
interpret H'(G(L/K), A) as parameterizing certain algebraic structures, specifically a set of 
them defined over K that become isomorphic in L. (This is known as descent because it 
answers the question, how many ways can an algebraic structure (or in general, a variety) 
“descend” from L to K’?) In general, 


H'(G(L/K), {automorphisms preserving V over K}) 
~ {K-isomorphism classes that are L-congruent to V }ea:descent (26.8) 


In this section, we will see several examples where A is an algebraic group. We could 
also take A to be an abelian variety (see Silverman [Sil86], Theorem X.2.2, for instance). 

In particular, we find in the next section that a special cohomology group classifies algebra 
structures over K: the Brauer group. 

First, we need the following nonabelian generalization of Hilbert’s Theorem 90 (26.1.1). 


Theorem 26.3.1 (Generalization of Hilbert’s Theorem 90): tm:gen-noo For any finite Galois 
extension L/K, letting G= G(L/K), 


MG, Gh, (f\)= AG, Sh b)) =1, 
Proof. As in Theorem }26.1.1| given a 1-cocycle c: G + GL,,(L), consider the function 


b: GL,(L) > M,(L) 
b(A) := S° ¢,7 (A). 


TEG 
Note that unlike in the proof of Theorem |26.1.1} we not only have to choose A to be nonzero, 
but also invertible. 
Also define 6 on L” in the same way: 


b: L™ 4 L” 
bai Sere): 
TEG 


We show that {b(x):x € LZ"} generate L” as a vector space over L.* Suppose a linear 
functional f : L" > L vanishes on all the b(x). Then for every a € L, 


0= f(d(ax)) = 0 fler(a)r(x)) =D) ra) F (err (x). 


TEG TEG 


By linear independence of characters, we get that all the coefficients of the 7(@) must be 0, 
i.e. f(c,-7(x)) for all c,,x. But c, € GL,(Z) is invertible, so f must vanish identically on 


?Note b is not a L-linear transformation; it is a K-linear transformation. 
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L”,. We’ve shown that every linear functional vanishing on {b(x)} vanishes on L”; therefore 


span, {b(x)} = L”. 
Thus we can choose x;,...,X, such that y; = b(x,;) form a basis for L” over L. Let A be 
the matrix sending the canonical basis e; to the x;. Then (note 7 acts trivially on the e;) 


b(A)e; = b(Ae;) = y; 


so b(A) is invertible. 
The the rest of the proof of Theorem |26.1.1]goes through: we have as in (26.1) that 


cy = b(A)o(b(A)) 7, 


ie. cis a coboundary. This shows H!(G,GL,(L)) = 1. 
For the second part, the exact sequence 


iS Siti GLA or 1 
gives the long exact sequence |25]25.15.3 


H°(G,GL,(L)) > HG, L*) —> H1(G, SL,(L)) —+ H1(G, GL,(L)) 


GL)" —_ 9 k* 


As the map on the left is surjective, we get H'(G,SL,(L)) = 1. 


We have now established when V is a vector space: all vector spaces that become 
isomorphic in ss = the same Tension to begin with so are isomorphic in K, so the right- 
hand side of (26.8) is {1}, and if V = A”, GL,,(Z) is the group of automorphisms preserving 
V over L, eat anes shows the right-hand side of ( (26.8) is {1}. We now extend 
this to oes algebraic senacuaTes 

To encode an algebraic structure, we consider vector spaces and tensors. 


Example 26.3.2: ex:tensors 
Let V be a finite-dimensional vector space. The space V®? @ V*®% encodes. .. 


Structure 
vectors 
linear functionals 
linear operators 
bilinear forms 
algebra structures 


Ry Ol rR} Ole |'s 
NO) DOP RR} OR 


We focus on the case p = 1, q = 2. Given a tensor >; v; ® f; @ gi € V @ V*®?, define a (not 
necessarily commutative or associative) algebra structure on V by 


CS >» filv)g (w)u 
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Conversely, any algebra structure can be encoded in this way: Take a basis {v;} for V and 
a dual basis f; for V*, and encode the structure by 1°; ;(u- vj) ® fi @ gj. 


Definition 26.3.3: Let V be a vector space over K and x € V®? @V*@4 be a tensor of type 
(p,q). Two pairs (V,) and (V’, 2’) are isomorphic if there is a K-linear isomorphism 


FeV 
such that f(z) = 2’. Here, f sends 
cacextend-tensorl1 @+ + @Lp@fi®-+-@fq f(r1)@--- f(tp)@(frof')®---@(faof*). (26.9) 


Given (V,2) defined over K, we can consider it over L by extending scalars; denote the 
resulting pair by (V, = V @x L, zz). 

We say that (V,x) and (V’, 2’) are L-isomorphic if (Vr, x ,) and (V;, 2’,) are isomorphic. 
Let Ey..(L/) denote the L-isomorphism classes of pairs that are K-equivalent to (V,«). If 
L/K is Galois, let s € G(L/K) act on v@a €V @xK L=V;, by s(u @x a) := Vv @x s(a) and 
let s act on Ay by conjugation: 

Fesafea, 


Theorem 26.3.4 (Descent for tensors): tim:aescent-tensors Let L/K be a Galois extension, G = 
G(L/K), and let Ay be the group of L-automorphisms of (Vz, x,). Define the map 


0: Ey.(L/K) + H'(G, Ar) 
(Viele (d:onftof?=ftogofog') 


where f : (Vz, az) > (Vi, 2/,) is any L-automorphism. Then @ is a bijection. 
Proof. We show the following. 


1. @ is well-defined. First, 0(V’, x’) is a cocycle as 


d(ot) = frlotft 0 = (fo fo) [o( frist Yom] = d(o) a(t)”. 


(See Definition }25]25.15.2|) Next, we show 6(V’,z’) does not depend on the choice of 
f: Let d;(a) = f-'o fo and d,(s) = g"'ogo'. Then 


dj(o) =g ‘ogo t= g"f(flofa)of go = (fg")ds(o)(fg 7)” 


so dy and d, are cohomologous. 


2. 0 is injective. Suppose 6(Vj, x) = 6(V3, 25). We can choose the isomorphisms f; and 
fo such that f;'f? = fo fZ for all o € G(L/K). Then (fof; ')” = fof,’ for all 
o €G(L/K), ie. fofy* is an isomorphism defined over K. Thus (Vj, 2) and (V3, 75) 
are K-isomorphic. 
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3. 6 is surjective. Let c, be a l-cocycle of G with values in Ay. Since Ay C GL(Vz), by 
Theorem |26.3.1|there exists f € GL(Vz) such that 


er ae 
Let f operate on V2? @ V*®4 as in (26.9) and let 2’ = f(r). Asa € Ve? @ Ve" and 
Co fixes K, we have 
o(z') = f(o(z)) = fe) = fee(a)= fiz) =e. 


Thus 2’ is rational over K (i.e. in Ve? @ V;-°"), and (V, 2’) maps to Cg. 


Note that since we always take an isomorphism V — V’, we can really consider all the 
vector spaces to be the “same,” and just vary the tensors x. If we consider V = V’, then we 
abbreviate f : (Vr,2,) > (V;,2,) by f: 2 2’. 


Example 26.3.5: We can use Galois cohomology to classify quadratic forms over a field K. 
Let ® be a quadratic form (which corresponds to a bilinear form and can be represented by 
a tensor of type (0,2)), and O,(®) be the orthogonal group of ®, i.e. linear transformations 
that preserve ®. Then H'(G(L/K),O ,(®)) classifies the quadratic forms over K that are 
L-isomorphic to ®. 


4 Brauer group 


brauer The Brauer group characterizes algebras over a field K. We already know a simple 
way of making algebras: just consider the algebra of n x n matrices, M,,(/). Thus, we will 
essentially “mod out” by these when constructing the group. 

As we will see, there is an isomorphism to a second cohomology group. Thus, we can apply 
results about algebras over K to Galois cohomology, or conversely, apply Galois cohomology 
to get information on algebras over K. 

First, we need some results from noncommutative algebra. We refer the reader to 
Cohn [Coh03], Chapter 5, or Milne [Mil08], Chapter IV.1-2, for the proofs. 


4.1 Background from noncommutative algebra 


Definition 26.4.1: An algebra over a field K is a ring A with K in its center®. Its 
dimension is the dimension of A as a K-vector space, denoted [A : K]. In this chapter we 
assume all algebras to be finite-dimensional as K -vector spaces. 

An algebra over K is 


1. simple if it has no proper two-sided ideals. 


3The center of a ring R is the set of elements commuting with all elements of R. 
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2. central if its center in Kk. 
An algebra is a division algebra if every nonzero element has an inverse. 


Example 26.4.2: The algebra of n x n matrices M,,(/) is a central simple algebra over 
ie 


Definition 26.4.3: Let A be an algebra over K. We use “A-module” to mean any finitely 
generated left A-module V; the map A > End(V) is called a representation of A. The 
module (or representation) is faithful if av = 0 for all v € V implies a = 0, ie. AO 
End(V) is injective. A module is simple if it doesn’t contain a proper A-submodule, and 
indecomposable if it is not the direct sum of two proper A-submodules. (Note that simple 
implies indecomposable, but not vice versa.) A module is semisimple if it is the direct sum 
of simple A-modules.* 
We say A is semisimple if it is semisimple as a module. 


We need some basic results from noncommutative algebra. 


Definition 26.4.4: Let B C A be a subalgebra. Define the centralizer of B to be the 
elements of A commuting with B: 


C(B) := {a € A: ab = ba for all b € B}. 
Theorem 26.4.5 (Double centralizer theorem): thm:doubie-centratizer Let A be a K-algebra, and 
V a faithful semisimple A-module. Consider A as a subalgebra of End«(V). Then 


C(C(A)) =A. 


Proof. Milne [Mil08], Theorem IV.1.3, or Etingof [Eti], Theorem 4.54. 


Theorem 26.4.6 (Wedderburn’s structure theorem): tnm:weaderburn An algebra A is semisimple 
iff it is isomorphic to the direct sum of matrix algebras over division algebras. 

If A is an algebra over an algebraically closed field K and K, then any semisimple algebra 
over K is isomorphic to a direct sum of matrix algebras over K. 


Proof. Milne |Mil08}, Theorem IV.1.15. 

For the second part, we need to show the only division algebra over an algebraically 
closed field K is K itself. Suppose D is a division algebra and a € D. As [D : K}] is 
finite-dimensional, A’(q@) is a finite extension of K. Hence a € K, giving D= K. 


‘Equivalently, the radical of A is trivial. If it is semisimple the factors in the decomposition are unique 
up to isomorphism (Jordan-Hélder). 
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Theorem 26.4.7 (Noether-Skolem theorem): thm:ncetherskoiem Let f,g : A + B be homomor- 
phisms, where A is a simple K-algebra and B is a central simple K-algebra. Then there 
exists b € B such that 


f(a) = b- g(a) -b- 


for alla € A, ie. f,g differ by an inner automorphism of B. 
In particular, taking A = B and g = 1, all automorphisms of a central simple K-algebra 
are inner (come from conjugation). In particular, this is true for M,(K). 


4.2 Central simple algebras and the Brauer group 


We now define the Brauer group. 


Definition 26.4.8: Let A and B be simple algebras over kK. We say A and B are similar 
and write A ~ B if 
A@K Mn(K) = B@xKMn(K) 


for some m,n. 
The Brauer group Brx is the set of similarity classes of central simple algebras over 
K, with multiplication defined by 


[A][B] = [A @x Bl. 


The Brauer group Bry x is the subgroup of classes of central simple algebras over AK that 
are split over L, i.e. such that A @x L is a matrix algebra. 


Proof (sketch) that this is a group. We need to check that... 


1. The tensor product of two central simple algebras is central simple. By Wedderburn’s 
Theorem [26.4.6] we can write the algebras as A = M,,(D) and B = M,,(D’), where 
D, D’ are division algebras. One can show A ®x D’ is simple; hence it equals M/,,(D”) 
for some D”. Then A @xK B= Mim(D") is simple. It is central because C(A @x B) = 
C(A) @x C(B) = K. 


2. “~” is an equivalence relation. If A~ Band B~ C, then A@®xK Mn(K) = Br @x 
M,(K), B@xK Mn (K) =C @x M,(K) for some m,n, n’,p. Then 


A@KMmn (K) = A@KMm(K)@xKMy(K) as COKMn(K)@xM,(K) = C@KMny(K). 


3. “~” is preserved under the operation ®. If A; @% Mm,(K) = Bi; @~K Mn,(K) for 
4 = 1,2, then A; @x Az @K Minym,(K) = Bi @x Bo @K Mri. (K). 


4. A has an inverse. Letting A°PP be the opposite algebra, we find that 


A®x APP = M,(K), n=([A: Ki]. 
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5. The operation is commutative and associative. This follows since tensor product is 
commutative and associative. 


By Wedderburn’s Structure Theorem [26.4.6] each (central) simple algebra is M,,(D) = 
M,(K) ®@x D for some (central) division algebra D, so every similarity class is represented by 
a central division algebra. Thus to determine the Brauer group it suffices to classify central 
division algebras. 


Example 26.4.9: We have 


Brg = {R, H} 


where H denotes the quaternions: the algebra with basis 1,i,j,k = ij and relations i? = 1, 
j° =1, and ij = —ji. 

Indeed, by Frobenius’s Theorem, the only finite-dimensional (associative) division alge- 
bras over R are R, C, and H, and only R and H have center equal to R. 


Proposition 26.4.10: For any algebraically closed field Kk, 
Brz = 0. 


Proof. By Wedderburn’s Theorem |26.4.6| all central simple algebras over K are M,,(/’) for 


some 7. 


4.3. Subfields and splitting of central simple algebras 


An important way of studying a central simple algebra is to look at its subfields. 


Theorem 26.4.11 (Double centralizer theorem, generalization): tnm:act-gen Let A be a central 
simple K-algebra and B be a simple K-subalgebra. Let C = C(B). Then C is simple, 
C(C) = A, and 

[B: K||C: kK] =|A: K]. 


Proof. See Milne |Mil08], Theorem IV.3.1. 


Corollary 26.4.12: Let A be central simple over A, and L be a subfield with K CLC A. 
Then the following are equivalent. 


1. b=C(L). 
2: (Arkh) = lhe kP. 


3. L is the maximal commutative K-subalgebra of A. 


Proof. Milne |Mil08}, Corollary IV.3.4. 
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The following describes the fields over which a central simple algebra splits. 


Corollary 26.4.13: corcsa-spit Let A be central simple over kK. A finite extension field M 
splits A iff there exists an algebra B ~ A containing M with [B : K] = [L: K]?. In 
particular, any subfield L of A of degree \/[A : K] splits A. 

If D is a divison algebra of degree n? over K, and L is a field of degree n over K 
(equivalently a maximal commutative subfield of D), then L splits D, i.e. D=M,,(L). 


Proof. Milne |Mil08}, [V.3.6, and 3.7. 


Theorem 26.4.14: tim-airspit Every central division algebra over K is split over some finite 
Galois extension L/K. Therefore 


Bre = Brg x = U Brg yes 
L/K finite Galois 


Proof. When K is perfect, this follows directly from Corollary |26.4.13| The general case 
requires a separate argument; see Milne |Mil08), IV.3.10. 


Similar to the commutative case, we can define a valuation on division algebras. 


Proposition 26.4.15: praiv-aigvai Let D be a division algebra of rank n? over a local field 
kK. Then D admits a discrete valuation extending the valuation on K, such that for any 

€ (0,1), ||z||5 := a’ defines a norm on D. The set of integral elements {x : v(x) > 0} is 
a subring of D. 


5 Brauer group and cohomology 


5.1 The Brauer group is a second cohomology group 


Definition 26.5.1: Let Brz/x,, denote the subset of Brz/x consisting of [A] where A@« L = 
M,(L). Note that Brpj~ = Unen Bry/K,n- 


Theorem 26.5.2 (Cohomological interpretation of Brauer group): thm:brauer-cohom There are 
canonical bijections 


On Brr/Kn = H'(G, PGL,(K)) 


and canonical isomorphisms 


6: Brisk => HL KR) 
5: Bre > H2(K) 


where H?(K) := H(K/K) HL): 


a ima ig finite Galois 
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Proof. We can represent elements of Brz/x, as algebras of dimension n? over K, that are 
L-isomorphic to the algebra M,,(L). By Example ]26.3.2| we can encode the algebra M,,(L) 
by a tensor of type (1,2). By Theorem |26.3.4} 


eq:brauer-descent Broien = H\(G, Aut (M,(Z))). (26.10) 


By the Noether-Skolem Theorem|26.4.7| every automorphism of M,,(L) is conjugation by an 
element of GL, (A). Since the matrices that act trivially by conjugation are just the scalar 
matrices, we have the short exact sequence 


einsekebsiaian |. a i. — GL, (LZ) —> Aut(M,,(L)) = PGL,,(L) —> 1. (26.11) 
Along with (26.10) this proves the first part. 


The long exact sequence |25]25.15.3] of (26.11) gives 
0=H(G,GL,(D)) > W(G,PGL, (1) = WG, L*), 
where the LHS follows from Theorem |26.3.1} Let 6, = A, ° @,; then 6, is an injective map. 

We show the following. 
1. The 0, for different n combine compatibly into an injective group homomorphism 

0: Br(L/K) + H?(L/K): We need to show 

Onn (A @ A’) = bn(A) bn (A’) 
for any A € Bry» and A’ € BryK wy. 


First, note that if a,a’ are tensors encoding the algebras A, A’ on V ® V*®? and V’ @ 
v"® then « @ 2’ encodes the algebra A @ A’ on (V @ V’) @ (V @V’)*®?. Let x, x 
encode M,(/) and M,,(K), so that x @ x encodes M,,(K). If f : « > a and 
f’: a’ >a’ are L-linear maps, then we have the L-linear map on M,,,(L), 


f®f':2@2' >a@d. 


Now 0,0, map A and A’ toc, = f~1o f? andcL = f'| o f’”. Suppose c, and Cc, 
are represented by conjugation by S, and S$”, respectively. Now 0,n, maps A® A’ onto 
do = (f @ f’)-'o(f @ f’)’, which corresponds to conjugation by S, @ S$”. Then by the 
description of A in Theorem |25]25.15.2| we see that 


bmnt(A @ A’) = fagr = ind [(S, @ S!)o(S, @ S!)(Sor @ St.) !]} = 5n(A)bn(A’) 


where inn is the inclusion map L* — GLyny(L). Under the inverse of inn: = in ® in’, 
tensor product becomes simply the product. 


2. 6 is surjective. It suffices to show A,, is surjective, where n = [L : K].° Take an 
2-cocycle a,,, € H?(G, L*). We need to show that 


iz.2=8,0(8)5.° 


*Incidentally, this shows that every equivalence class of algebras is represented by one of dimension at 
most [L : K]?. This is consistent with results of the previous section. 
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for some values of S, € GL,(L). We identify L” with the group algebra L[G], and let 
So € GL(L[G]) be the map sending 7 to a,,,o7 (it is invertible as a, € L*). Then 
we calculate for every ue GC L|GI, 


eC (oe) t= (Oe 0 Ory) |OTU 


epee S(O OF: 
The right-hand sides are equal since a,,, is a cocycle. Hence 
Agr = Seo(S;) S57 
is in the image of A,. 


3. 6 gives an isomorphism Brx & H?(K): This follows from Theorem |26.4.14} the follow- 
ing easy-to-check commutative diagram (which holds for any K CLC M), 


H?(L/K)—5S H?(M/K) 
| | 
Bryjx$ > BruK, 


and taking the direct limit of the maps Bry;x% — H?(L/K). 


Remark 26.5.3: Milne |Mil08| makes this correspondence more explicit. The relationship 
between the two approaches can be seen by choosing a basis for the tensor product V @ V*®?; 
the coefficients are called the structure constants of the algebra. (We followed Serre; note 
that the isomorphism in Serre is the opposite of the isomorphism in Milne.) 


5.2 Exact sequence of Brauer groups 
The importance of the Brauer group in class field theory is given by the following proposition. 
Theorem 26.5.4: brauer2 Let M/L/K be Galois extensions. Then there is an exact sequence 


0 —+ H?(L/K) —> H?(M/K) —> H?(M/L) 


Bry/K BruyK Bru/t . 


For any Galois extension L/K there is an exact sequence 


0 —+ H*(L/K) —> H?(K) ——+ H?(L) 


Bry/K Brr Bry. 
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Proof. Since H'(L/K) = 0 by Hilbert’s Theorem 90 (26.1.1), the inflation-restriction exact 
sequence |25]25.11.10| with G = G(M/K) and H = G(M/L) gives 


03 H?(L/K) @s H?(M/K) 2% H?(M/L). 


The equality with the Brauer groups follows from Theorem |26.5.2 
Taking the direct limit over all finite Galois extensions M/K gives the second result. 


6 Problems 


2.1 (Artin-Schreier) Let L/K be a Galois extension of degree p, with K/F, a finite exten- 
sion. Prove that L = K(a) for some a such that a? —a € K. (Hint: Consider a short 
exact sequence as in the proof of Kummer theory. However, use the map x +> x? — x 
instead of x ++ x?, and consider additive instead of multiplicative groups.) 
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Chapter 27 


Local class field theory 


ictt We now prove the main theorems of class field theory using cohomology. Throughout this 
chapter, K, L, etc. will denote nonarchimedean local fields, unless specified otherwise.! The 
main steps are the following. 


1. Construct the invariant map H?(K™/K) — Q/Z. (Proposition |27.2.1) 


(a) Show that H?(G(K™/K), Ux) = 0. (Theorem |27.1.1) 


(b) From the decomposition AY* = Ugur xZ and step 1, we get H?(G(K™/K), K™*) = 
H?(G(K"/K),Z). (Note the projection K™* — Z is the valuation map vu.) 
Relate H?(G(K™/K),Z) to Q/Z using the long exact sequence in cohomology 
associated to 0 —- Z>+ Q> Q/Z > 0. 


2. Now show that there is an isomorphism Bry := H?(K/K) = H?(K™/K) (Theo- 
rem |27.3.1). Thus we can restrict attention to unramified extensions of K and use step 
1. Unramified extensions are easier to deal with! There are two approaches: 


(a) By Theorem |26]26.5.4| there is an exact sequence 


0 H?(K"/K) > Brx > Bru. 


Show that Brxu = 0 by considering central simple algebras over local fields. 


(b) Study the cohomology of Uz when L/K is cyclic to conclude that the Herbrand 
quotient h(U,) is 1. From this get h(L*) = [LZ : K]. From this calculation and 
Hilbert’s Theorem 90 ((26]26.1.1), compute? 


IH"(L/K)| = 1, 
IH?(L/K)| = [L: Ki. 


Conclude that H?(L/K) is cyclic of order [LZ : K] and hence included in H?7(K™/K), 
for any finite L/K. 


Local class field theory for R and C is trivial and left to the reader. (The only nontrivial field extension 
is C/R.) 
?This is the input for abstract class field theory according to Neukirch [Neu99]. 
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3. Combining the first two steps, we get the invariant map invx : Brx > Q/Z. Show that 
this is compatible with restriction and hence that (G(K/K), K) is a class formation. 
Note invx restricts to H?(L/K) > TE: RZ supposing its image is generated by upK, 
Tate’s Theorem [25]25.13.1] gives an isomorphism 


H7z?(G(L/K),Z) Z) —“"*, H9(G, L*) 


G(L/K)» K*/Nmz/x(L*) 


that sends Frob;;« to [t] when L/K is unramified. Taking a direct limit, we get a 
map K* + G(K?>/K). Note we only get a map from G*» (norm limitation). 


4. Study the Hilbert symbol to prove the existence theorem (See Sections |6}/7). 


Unfortunately it is quite difficult to trace through the maps to find out what the Artin map 
actually is—for this Lubin-Tate Theory is better. 


1 Cohomology of the units 
For an unramified extension, the cohomology of the units is trivial. 


Theorem 27.1.1 (Cohomology of units): cohomotogy-unitstriviaa Suppose L/K is a finite unram- 
ified extension of local fields with Galois group G. Let Uz, be the group of units of L. 
Then 

AA(G,U;) =1 


for any r. Hence H"(G(K™/K), Ugur) = 0 for n > 0. 


Proof. We will show that 


Then it follows from Proposition [25]25.12.1] that all the Tate groups are trivial. The second 
part follows from taking the direct limit. 
We have 
eq:i=uix2zL* = Uy, x m4 2U, x Z (27.1) 


where 7 is a uniformizer for L. Since L/K is unramified, we can choose 7 € K. Then G 
acts trivially on 7, so acts trivially on Z in the decomposition above. Thus (27.1) gives a 
decomposition of L* as a G-module (not just as a group). We have by Hilbert’s Theorem 90 
(Theorem |26]26.1.1) and the fact that cohomology respects products (Proposition |25]25.6.7) 
that 

(2 2°(G, i) =H (G,U;) xf (G,Z). 


Hence H'(G, Uz) = 1. 
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It remains to show H9(G,U,) = 1. To do this, let m be the maximal ideal of L, U(” := 
1+m”, and consider the filtration 


ue = UK > u > U2) yore 
Proposition |27.1.2] and |27.1.3] below show that each quotient has trivial cohomology: 


Aye 0 a1. 


Then Lemma|27.1.4| gives that H?2(G,U,) = 1, as needed. 


Proposition 27.1.2: unitssittration2 Let K be a complete field with discrete valuation, m be 
the associated maximal ideal, and U. ie :=1+m"”. Then we have isomorphisms 


UE/0y ae YU 
ur>u_ (mod m) l+an™+'>a (mod m) 


that preserve Galois action. 


Proof. This is Proposition |22]22.4.8 


Proposition 27.1.3: cohom-fnite-tcias Let 1/k be an extension of finite fields and G := G(I/k). 
Then 


Hy(Gl*) = {1} 
H7(G,I") = {0} 


for all r € Z. Moreover, the maps Nmj/, : 1 — k and try, : 1 — k are surjective. 


Proof. By Hilbert’s Theorem 90 (26]26.1.1), H'(G,1*) = 0. Since G is cyclic and | is finite, 


by Proposition |25]25.12.4| h(/*) = 1, giving H?(G,I*) = 0. Again since G is cyclic, by 
Theorem |25]25.12.1| all the Tate groups are 0. 


From Theorem |26]26.1.2} Hi(G,I*) =0 for r > 0. 


For the second statement, just note 


{1} = Hp(G,l*) = ()9/Ne@(I*) = k* / Nm, (1”) 
{0} = Ap(G,It) =19/No()) = k/trys (I). 


Lemma 27.1.4: sitrationon Let G be a finite group and M be a G-module. Let 
M=M°DM!)D..- 


be a decreasing sequence of G-submodules and suppose M = lim M/M? (i.e. M is complete 
with respect to this filtration). If H4(G, M'/M*‘t!) = 0 for all i, then H4(G, M) = 0. 
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Proof. Let f be a q-cocycle of M. Since H4(G,M/M') = 0, the long exact sequence of 
0+ M'—+ M > M/M' gives H4(G, M') - H4(G, M) and we can write f = go+ f1, where 
go = Oho is a coboundary in M and f; is a q-cocycle in M'. Given f, € H4(G, M"), we can 
write 


Tri = On ai Fact 
where h, is a (q¢ — 1)-cocycle of M” and fy+41 is a g-cocycle of M"*t'. Then 


f =6(hy tho +++), 


the infinite series being defined in H4~!(G, M) since h,, is a cochain with values in M™, and 
M is complete with respect to this filtration. 


This proves Theorem [27.1.1] We record the following corollary, for easy reference. 
Corollary 27.1.5: tnmccainm-sui Suppose L/K is a finite extension of local fields. Then 
Ux CNmyz/x U_. 
Proof. If L/K is Galois, then this follows since by Theorem [27.1.1] 
Ux/Nmpz/x Ur = Hp(G(L/K), Uz) = {1} 


so the norm map Uz, — Ux is surjective. 
For general extensions L/K, consider the Galois closure and use transitivity of norms. 


2 The invariant map 


2.1 Defining the invariant maps 


Proposition 27.2.1: invariant-map For any finite unramified Galois extension of local fields L/k 
there is a canonical isomorphism 

1 
[L : K] 


invy/x : H?(L/K) > Z/Z. 


Taking the direct limit gives an injective map 
invxe/_ : H?(K™/K) > Q/Z. 
Proof. Consider the short exact sequence 
13 U, 3 L* 4+Z-0. 


Since Hi\(G, U,) = 0 for all n by Theorem |27.1.1| taking the long exact sequence gives 


IPG). * H?(L/K) > H?(G,Z) + H(G;t). ° 
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We relate H?(G,Z) to a lower cohomology group by considering the short exact sequence 


0-Z>Q-Q/Z- 0. 


Note H"(G, Q) is torsion for any n > 0 by Corollary |25/25.11.6| Since Q is a divisible group, 
so is H"(G, Q), by looking at the description of H” in terms of cocycles (Section|25]7). Hence 


H"(G, Q) = 0 for any n > 0. Taking the long exact sequence of the above we get 


HUGO) H'(G,Q/Z) % H°(G,2) + HAG 


Thus we get a map 


0 


. 2 S772 = pl ae 1 
eq:inv-unram INVE/K 7 A (L/K) —> A (G, Z) — A (G, Q/Z) = Hom(G, Q/Z) — [L : K] Z/Z. 
(27.2) 


where the last is defined by taking the Frobenius element o of G and mapping f +> f(c). 
(Note G is cyclic and o generates G; the Frobenius is a canonical choice.) 
Now define invgu/sK = lim, ee ae ee invy/x«, taking the direct limit under 


inflation. Since inflation is functorial, the first two maps in commute with it. Identi- 
fying H'(G,Q/Z) ~ Hom(G, Q/Z), inflation sends a map G(L/K) > Q/Z to G(M/K) > 
G(L/K) — Q/Z. Moreover, Froby/« is the projection of Froby/x% to G(L/K). Hence Inf yy/p 
commutes with the inclusion map rReZ /Z—4 mKZ /Z, and the inv;;« form a compatible 
system under inflation. 


Remark 27.2.2: Let K be any nonarchimedean complete field (not necessarily local) with 
residue field k. Then 


A"(L/K) = H"(1/k) x H"(G(L/K), Q/Z). 
Indeed, Proposition [27.1.2] and Theorem [26]26.1.2] still give 
Hy(G,Up/Up*) = H7(G,I*) =0 


fori > 1. This gives H7(G, uy) = 0 by Lemma ]27.1.4| From the long exact sequence 
associated to 
ieUP su. SU ers 


we get 
H"(L/K) 2 H"(G, Uz) x H"(G,Z) = H"(G,I*) x H"(G,Z). 


In the case of a local field, / was finite so H"(G,1*) = 1. 


2.2 Compatibility of the invariant maps 


We show that the invariant maps are compatible, in the following sense. 
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Theorem 27.2.3: tnm:inv-compatibie Let L/K be a Galois extension of local fields, and n = [L: 
K]. Then 


Inv «ur/p 0 Resx/p = NINV Kur /K 


Proof. To do this we have to unravel all those steps we took to define invqu/,... We first 
prove this for two special cases. 


1. L/K is unramified. Let G = G(kK™/K) and S = G(k™/L). We claim the following 


commutes. 


H?(K™" /K) —> H?(G,Z) —— H1(G,Q/Z) — Q/Z 
[es ke [es if 
H?(k™ /L) ——> H?(S,Z)—— H(S, Q/Z) > Q/Z. 


For the squares involving Res, this follows from naturality of Res. For the last square, 
identify H'(G,Q/Z) = Hom(G,Q/Z); Res becomes simply restriction of homomor- 
phisms. Recall that y was defined taking the Frobenius Frob(k™/Kk) € G(k™/K) 
and sending f € H'(G, Q/Z) = Hom(G, Q/Z) to f(a), and we have 


Frobyeur => Frob jour /z, 


by Proposition |24]24.1.4 
2. L/K is totally ramified. Note that G = G(K™/K) = G(K™L/L) = G(L"/L) in 
this case, from the description of K“" in Theorem |21/21.2.6| We show the following 
commutes: 
H?(K™ /K) —> H?(G,Z) —— H\(G,Q/Z) ++ Q/Z 
ee ae 
H?(K™ /L) —~+ H*(G,Z)— H\(G,Q/Z)*— Q/Z. 


The first square commutes by commutativity of 


UK 
kux Z, 


_f 


pe* —* 7, 


(and of course, naturality of cohomology). Here vx and vz are the valuation maps, i.e. 
the projections K* ~Ux x Z—> Zand LX =U, x ZZ. 


The general case follows by considering L/L'“/* (totally ramified) and L’“/* /K (unramified). 


(See Theorem |1515.7.2}) 
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3 H?(K/K) = H?(K™/K) 

We prove the following. 

Theorem 27.3.1: nx: The inclusion (inflation) map 
H?(K™/K) > H?(K/K) 

is an isomorphism. 


For short we write H?(K) := H?(K/K). 


3.1 First proof (Brauer group) 
First proof. By Proposition |26]26.5.4| there is an exact sequence 


0 H?(K™/K) > H2(K) > H?(K™) =Begar® 


The last term is zero by Theorem |27.3.2| below. Thus we get H?(K"™/K) = H?(K), as 
needed. 


Theorem 27.3.2: treuxo Let K be a local field. Then Brew = 0. 


We need two lemmas. 


Lemma. 27.3.3: tcmbrkuw0-1 Suppose D is a central division algebra of rank n? > 1 over a 


field K, and the residue field & is perfect. Then there exists a commutative subfield L of D 
properly containing AK’, unramified over Kk. 


Lemma 27.3.4: tem:briuro-2 Keep the same hypotheses as Lemma }27.3.3} There is a subfield 
of D of degree n unramified over K. 


Note this is a maximal subfield by Corollary |26.4.13 


Proof of Lemma[27.3.5, Suppose by way of contradiction that every commutative subfield 
L of D properly containing K is ramified. Then the extension of residue fields //k must be 
trivial (see ston BOLE ETSI 8 Let a € D be integral and a € D be a uniformizer for D. 
(See Proposition |26]26.4.15]) Since 1 = k, there exists b € K such that b =a (mod 7), and 
we can write a = b+ 7b, for some b; € @p, where @p is the ring of integers in D. Iterating 


this with b;, we find 
a=b+abj+---+271b,_-1 +0 bn 


where b),...,b,-1 € @x and b, € @p. Thus a is in the closure of K (7). But K (7) is closed 
(it is a finite-dimensional vector space over kK), soa € K(m), ie. D = K(z) and D is 
commutative, a contradiction. 


335 


Number Theory, 827.3 


Proof of Lemma[27.3.4, Induct on n. The case n = 1 is clear. Let n > 2. By Lemma/27.3.3} 
there exists a proper unramified extension K'/K inside D. Let D’ = C(K’). Since D’ C D, 
D' must be a division algebra (a finite dimensional integral domain must contain inverses). 
Let its center be Kk”. The maximal commutative subfield of D’ then has dimension \/[D’ : Kk" 
over K”, or dimension ,/[D’: K"|[K" : K] = \/[D’: K][K”: K] over K. This is at most n, 
since the field is also contained in D. But \/[D’: K][K’: K] =n by the double centralizer 
theorem so we must have K” = K. Thus D’ is a division algebra with center 
K'. Its degree over K’' is less than n?, so by the induction hypothesis, D’ has a maximal 
commutative subfield L containing A’, of degree ,/[D’: K’|, and unramified over kK’, hence 
over kK. We calculate 


[Oo KR) = (be RK’ 2) f(D RK: K) 4) (D! KKK) 2 4/[(D < K) 
where we used Theorem |26]26.4.11}in the last step. This finishes the induction step. 


Proof of Theorem Suppose D is a central division algebra over K™ of rank n?. Then 
lemma 2 furnishes a subfield of kK" of degree n, unramified over kK". Hence n = 1, and D 
is trivial. Thus Brxu = 0. This proves Theorem |27.3.2|and hence Theorem |27.3.1 


3.2 Second proof (Herbrand quotient calculation) 


3.2.1 Herbrand quotient calculation 


We first need the following lemma. 


Lemma 27.3.5: subgroup-triviat-cohom Given a local field L, there exists an open subgroup V of 
Uy with trivial cohomology, i.e. H4(G,V) = 0 for all q. 


Proof. The idea is to compare a multiplicative G-module V with an additive G-module (more 
accurately, compare the filtration of V), and use the same argument as in Theorem|1[26.1.2|° 
By the normal basis theorem, L* has a normal basis {o(a) : 0 € G}, i.e. it is free over 
K[G]. Let A = Seg G@xo(a).* By multiplying a by a power of 7x we may assume that 
a € @,. Suppose that 
TROL C A C Cy. 
Let M = ntt'A, V=14+M and V =1+7%M. Note that 


M-MC fae -AC TEN EOL e Te A CrKM. 


3If char(L) = 0 there is a faster proof: Note that e” is a topological isomorphism from a neighborhood of 
0 in the additive group L to a neighborhood of 1 in the multiplicative group @,. Moreover, it preserves the 
action of G because the fact that G acts continuously on L gives 


eet = S- ee) = S> ate ) = ce". 


Now Theorem |1]26.1.2| applies directly. 
4Warning: A is a Ox |[G]-module; we don’t know it is an @,-module. 
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This shows that 
1. V is a subgroup: Indeed, (1+ M)(1+ M)C14+M+M-MC14+4M by the above. 


2. V?/Vit! = A/rKA as G-modules. Indeed, if m1,m2 € M, then for some m3 € M, we 
have 


(4+-mjmy) (1+ ptm) = lta (mi tig) +aKTKMs = tT, (m+m2) (mod mf"). 


Hence 
HYG,V/V@)) = H4(G, M/tKM) =0 


for each q, since M/axM is an induced module over G (and has trivial cohomology by 
Shapiro’s Lemma |25]8.1). (By construction M/meM = Ind@[(attla@x)/(nr?a@x)].) 
Lemma .|27.1.4] applied to V finishes the proof. 


Proposition 27.3.6: herbrand-umits1 Suppose L/K is cyclic of degree n. Then 


Proof. Choose V as in Lemma |27.3.5| Since V is open, U,/V is finite. By Proposi- 
tion |25]25.12.4(1), h(U,/V) = 1. Hence 


h(Uz) = h(V)A(Uz/V) = 1. 
By Proposition |25]25.12.4(3), h(Z) = |G] =n. Since LX = U; x 77 we get 


h(L*) = h(Uz)h(Z) =n. 


Theorem 27.3.7 (Class field axiom for local class field theory): thm:cfa-tce Let L/K be a cyclic 
extension of degree n. Then 


JH'(L/K)| =1 
LAG K )| =: 


Proof. The first follows directly from Hilbert’s Theorem 90 (26.1.1). For the second, we have 
|H?(L/K)| = h(L*)|H*(L/K)| =n using Proposition |27.3.6 


We want to show that |H?(ZL/K)| =n for all Galois extensions L/K, and in fact H?(L/K) 
is cyclic of order n. We proceed in 2 steps. 
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3.2.2 First inequality 


We show that for all Galois extensions L/K, |H?(L/K)| > [LZ : K]. In fact, we show the 
following. 


Lemma 27.3.8: temccat-ieq Let L/K be a Galois extension of local fields of degree n. Then 
H?(L/K) contains a subgroup canonically isomorphic to 4Z/Z. 


Proof. We prove this using Theorem [27.2.3] which relates the invariant maps on K“"/K and 
L™/L. By Theorem [26]26.5.4] we have the exact sequence 0 > H?(L/K) > H?(K) > 
H?(L). Inflation and restriction commute by functoriality of change of group, so we have 
the commutative diagram with exact columns 


eq:local-leq 0 0 (27.3) 


H?(L/K)< ker(Res) 


H2(K) > H?(K™/K) 


Res Res 
H?(L) AM >H?(K"/L). 


By Theorem [27.2.3] the map H?(K“/K) — H?(L™/L) corresponds to the multiplication 
by-[Z : K] map after identifying both sides with a subgroup of Q/Z through the respective 
invariant maps. Hence ker(Res) = +Z/Z. The top map exists and is an injection because 
the other two are (4-lemma). Hence +Z/Z > H?(L/K), as needed. 


3.2.3 Second inequality 
Next we show |H?(L/K)| < [L: K], so |H?(L/K)| =[L: K]. 


Lemma 27.3.9: tem:tocai-2ineq Let L/K be a Galois extension of local fields of degree n. Then 
H?(L/K) = +Z/Z. 


Proof. We already know that |H?(L/K)| = [L : K] for L/K cyclic (Theorem [27.3.7). We 
prove that |H?(L/K)| = [L : K] by induction on the degree. 

By Corollary G(L/K) is solvable. Thus, if G(L/K) is not cyclic, it has a 
normal subgroup G(L/K"‘). By Theorem [26]26.5.4] we have an exact sequence 


0 H?(K'/K) > H?(L/K) + H?(L/K’) 


|HP(L/K)| < |B?(K'/K)|- |AP(L/K')| = [K's KIL: K'] = [L: Ky. 


By Lemma }]27.3.8} equality holds. 
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3.2.4 Finishing the proof 


Second proof of Theorem|27.3.1| Take any element a € H?(K/K); it is in H?(L/K) for 
some finite Galois L/K. The top injection in (27.3) is an isomorphism by Lemma [27.3.9 
and we get a € H?(K™/K). 


4 Class formations 


sec:class-formations The preceding sections show that 
(G(K/K),{G(L/K) : L/K finite Galois} , K) 


is a class formation. That is, it satisfies the basic axioms that allow us to obtain the 
conclusions of class field theory. With the abstraction of class formations, when we prove 
global class field theory, we only have to verify the axioms and we will get the desired 
conclusions in the same way as in local class field theory. 


4.1 Class formations in the abstract 


Definition 27.4.1: An abstract Galois group is a group G with a family of subgroups 
of finite index {Gz }rex such that 


1. (Closure under intersection) If L1, 22 € X, then there exists M such that 


Gig NG, = Gu. 


2. (Closure under superset) If G; C G’ C G are subgroups, then G’ = Gx for some K’. 
3. (Closure under conjugation) For every s € G and L € X there exists L’ so that 


sGis = Gr. 
This definition is motivated by the fact that these are the key properties of Galois groups. 


Proposition 27.4.2: A topological Galois group G(Q/Ko) with all its closed subgroups, is 
an abstract Galois group. 


Proof. By the fundamental theorem of infinite Galois theory |12]12.8.4| the closed subgroups 
of G(Q/Ko) are exactly those in the form G(Q/K) with Kg C K CQ. The above properties 
correspond to the following facts from Galois theory. 


1. G(Q/K) A G(Q/L) = G(Q/KL). 


2. The subgroups of G(Q/Ko) containing G(Q/L) correspond to intermediate extensions 
between Ko and L. 
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3. sG(Q/K)s~' = G(Q/sK). 
We transfer some terminology about Galois groups to the abstract case. 


Definition 27.4.3: Let (G,{Gz}zex) be an abstract Galois group. The elements of X are 
called fields. The field Ko with Gx, = G is called the basefield. For Gy C G_, define [M : L] 
to be |G, : Gy]; we say M/L is a Galois extension if Gy, < G_;, and write 


G(M/L) = G,/Gu 


(called the “Galois group” of W/L). We say M/L is abelian, etc. if GUM/L) is abelian, etc. 
The field M such that Gz, Gz, = Gy is called the composite of LZ, and Ly, and denoted 
by L,Lz; the field L’ such that sGys~! = Gy, is denoted by sL. 


Note every extension M/Z is contained in a Galois extension: Since [G', : Gy] is finite 
Gy has finitely many conjugates sG,,;s~' in Gz; by the axioms Gyy =), sGs~! for some 
M', called the Galois closure of M/L. 


Definition 27.4.4: A formation is a triple (G,{Gx}kex, A) where (G, {Gx} ex) is an 
abstract Galois group and A is a discrete topological G-module (see Definition |25]25.14.1). 
Let AK — ACK, 

Define the norm Nmy/x% : AL + Ax by letting Nmz/«(@) = Teeqquyeui/K) 7 (@) for 
any L’ Galois over K. 


For L/K Galois, we define H"(L/K) := H"(G(L/K),A,). We can define inflation, 
restriction, and corestriction maps in the natural way, with Resx,;, = Resg¢,/g,, and so 
forth. 


Definition 27.4.5: actciassformation A Class formation is a formation (G,{Gx}Kex, A) with 
a homomorphism invz/« : H?(L/K) + Q/Z for each Galois extension L/K, such that the 
following hold. 
1. H'(L/K) =0 for every cyclic extension of prime degree. 
1 


2. invz/« is an isomorphism from H?(L/K) to we Z/Z- 


3. (Compatibility under inflation) For any finite extension M/L, 
INV M/K ie) Inf w/t = InNVE/K : 


Hence we can define inv : lim, 2(L/K) > Q/Z. (This axiom implies that inflations 
are injective on H?, so we can think of H(K) := lim H?(L/K) as U, H?(L/K).) 


4. (Compatibility with restriction) For any finite Galois extension L/K, 


inv, o Resx;, = [L: K]invx . 
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Define the fundamental unit of L/K to be 


sedi i 
UL/K = IDV» [L: K] : 


Proposition 27.4.6: Assume a formation satisfies axiom 1. Then for every Galois extension 
L/K, 
AY L/K)=0. 


Proof. First we show this when [L : K] is a prime power p”. Induct on the degree. The base 
case is given. Every p-group has a subgroup of index p, so there is K C K’ C L such that 


G(K’'/K) has order p. By the inflation-restriction exact sequence |25}25.11.10] we get 


0 
0 eR MCRD) © MEER 


the first and last terms are 0 by axiom 1 and by the induction hypothesis. So H'(K/L) = 0. 
For general L/K, this shows H'(G(L/K),, A,) = 0, so the result follows from Corol- 
lary |25}25.11.8 


Proposition 27.4.7: Assume a formation satisfies axiom 2. Transferring the action of Res, 
Cor, and Inf to the subgroups of Q/Z, we get the following diagram: 


M 
[M:L] 

fe H?(M/L) “55 —> aZ/Z i 
[L:K] conn ( pews ( Jen | 

K H?(M/K) > apgZ/Z a oe: *(L/K) mR Z/Z 


Inf wry, i 
(Note Cory; o Resx/, = [LZ : K].) Moreover (passing to the limit), the following hold. 
1. For every extension L/K, 
Resx/p : H?(K) — H?(L) 
is surjective. 
2. For every extension L/K, 
Cory/x : H?(L) > H?(K) 


is injective, and 
inv x © Corz/K = invy. 
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3. For every s € G, letting s* : H?(K) > H?(sk), 


inv,. OS’ =invgz. 


Proof. The surjectivity of Resx,/,; in the diagram comes directly from the injectivity of inv, 
and invy o Resx/, = [L: K]invx. 
For the action of Corz/%, note 


inv ~ 0 Corz/K 0 Resx/r, = invx o[L : K] = invy o Resx/z, 


where the first follows from Theorem and the second from the axiom. Surjectivity 
of Resx;z gives invx © Cory/~% = invz, as needed. 

Items 1 and 2 now follow from taking the direct limit. 

For 3, let the basefield be Ko; note the map s* : H?(Ko) + H?(sKo) = H?(Ko) is the 
identity by Proposition sO iNVsK, 0S* = invg,. For arbitrary x € H?(K), by 
surjectivity of Resx;, we can write x = Resx, L(Zo). Since Res and s* commute (transport 
of structure), 


invsx«(s*z) = invsx(s* Resx,/K Zo) = invsK ReSsx/sKk,(8"20) = [SK : sKo] invsx,(%o) = invx(z). 


The reciprocity law follows from the properties of class formations. 
Theorem 27.4.8 (Abstract reciprocity law): tmabstract-reciprocity Let (G, {GK } ex, {Ax }, invi/K) 
be a class formation. Then there is a isomorphism 


pe 


Hy *(G(L/K), Z) —-—> H7(G, Az) 


G(L/K)?® Ax/Nmz;/x(Az) 


Here Nmyz/x% means Ng,/g,. Denote the reverse map by ¢1/xK. 


Proof. The identifications are from Theorem |25]25.8.3] and Definition |25]25.9.2} Axioms 1 
and 2 for class formation give that the two conditions of Tate’s Theorem |25]25.13.1] are 
satisfied. 


This map is hard to calculate directly because cup products on negative Tate cohomology 
are hard to deal with. The following helps us by transferring the cup products to nonnegative 
Tate groups. 


Theorem 27.4.9: thm:catcuiate-tocatartin Keep the above hypothesis. Then for any y € Hom®"(G(L/K), Q/Z) = 
H'(G,Q/Z) anda € Ax, 

x(r/K(@)) = inv (ZU 6x). 
Here @ denotes the image of a in H9(G(L/K), Ar) = Ar/Nmz;x Ar, and 6 is the diagonal 
morphism corresponding to the exact sequence 0 > Z > Q > Q/Z > 0. 
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Note this characterizes the reciprocity map since knowing the image of an element of an 
abelian group under all homomorphisms to Q/Z is equivalent to knowing the element itself.° 


Proof. Suppose x(¢r/x(a)) = 2. 
By the definition of the Artin map as the inverse of uz/x« Ue, we have 


@ = upsK U o1/xK(a). 


We now calculate the following (for easy reference, we note which cohomology groups the 
elements are in). 


eo OX = [ursK U }z/K(a)] U Ox 
SS Sa SS “MY 
0 2 9 —2 2 
= ur/K UlézsK (a) U 4x] associativity 
ee 
= uz/K Ulb(br/Kx(a)U x )I Theorem |25]25.10.1(4) 
<a a 
= ut/K US(x(¢r/K(a))) Theorem |25]25.10.3(3) 
Ree ace Ne ey 
2 0 
r 
_ UL/K Ud (-) 
2 0 
eq:calc-artinl = U U : i (27.4) 
or 
= TUuL/K Theorem |25125.10.3(1) 
; - r 
inv«(@U dx) = — = x(¢z/x(a))- 


In (27.4), we use the map in the snake lemma to calculate 6 (2): it pulls back to > EQS 
H7'(G,Q); the norm maps it tor = n-* € Q = H9(G,Q) D HY(G,Z). In the second- 
to-last line, we note that e Ur is simply multiplication by r in dimension 0, so Theorem 


25.10.3{1) tells us it is multiplication by r in dimension 2 as well. 


We need several naturality properties of the reciprocity map. 


Theorem 27.4.10: thm:reciprocity-natural Let M/L/K be Galois extensions. The following are 


5It may seem odd to calculate yo é, /« instead of dr/x« directly but keep in mind that for general L/K, 
Frobz«(p) is only defined to be a conjugacy class, and it is natural to look at the action of characters on 
conjugacy classes because characters are class functions. 
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commutative. 
ge A ae 
i | lee 
G(M/L)* 2 GUM /K)* G(M/K)8=—'G(M/L) 
Ag ———— Ag Ax 


OL/K 
OL/K OsL/sK OM/K 


G(L/K)* + G(sL/sK) G(M/K)*» —+G(L/K)*® 


Proof. First note that the maps in the first diagram are corestrictions and the maps in the 
second diagram (on the right) are restrictions by Proposition |25.11.4 
From axiom 4 of Proposition |27.4.5} we have 


Resx/i(um/K) = UM/L- 


We will use Proposition |25]25.11.9| about the commutativity of cup products with re- 
striction and corestriction. The first diagram follows from 


Cort x(x Uumyt) = Cort x(x U Resx/r(um/K)) = Cory x(x) Uum/k, «€G(M/L)*” 
The second diagram follows from 
Resk/p(# Uum/K) = Resxj,(2) UUm/L- 


The third diagram follows from the fact that the map s* : AL + Asx takes uz/K tO Usz/sK- 

For the last diagram, let y be a character on G(L/K), which gives a character ywjK on 
G(M/K) using the projection G(M/K) > G(L/K). By Theorem |27.4.9] we have, for any 
character x, 


xu/K(¢m/K(@)) = inve(GuyjK U 6xXM/K) = invK(GrKx U 6x) = x(¢r/K(@)) 


where @y/K,Gz/« are the images in H?.(M/K) and H}(L/K), respectively. 


The fourth diagram means that the maps @z/% are compatible, so we can define 


ox = lim on): A> @. 
L 


(Note A= A#.) 


Theorem 27.4.11 (Norm limitation): sorm-timitation Let (G,{Gx}, {Ax}, invrsK) be a class 
formation. Let L/K be an extension and E'/K be the largest abelian subextension. Then 


Nmy/K Ar => Nmg/K Ap. 
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Proof. Let L8*' be the Galois closure of L. Transitivity of norms (just look at the definition 
of norm...) gives us C. Conversely, suppose a € Nmgyx Ap. Let G = G(L®'/K) and 
H = G(L'/L). Since E is the largest abelian subextension of L& abelian over K and 
contained in L, the subgroup of G fixing it is G’H. We have the commutative diagram 


bpeal sp 


A, ——> H/H' 


Nmyp/K i 


brgal/K 


Ax —>G/G' 


we 


G/G'H 


where 7 is induced by inclusion. Because a € Nmg/K Ag, ¢g/K(a) = 1 in G/G'H. Thus 
brea/K(a) € G'H/G’, and @yea/q(a) is in the image of 7 and hence 70 (z//,, and there exists 
b€ Az such that dzsasx(a) = i(bzsa/p(b)). Then 


Preat/K (a) = 1(Preatz(b)) = brsayx(Nmz/«(0)). 


This means € ker(@zea/x) = Nmyeasx (Az); say it equals Nmyea/%(c). Then 


ees = 
Nmz/x (6) 


a= Nmyz/«(b Nmzeaiz(c)) E Nmz/x(Az), 


as needed. 


Definition 27.4.12: A subgroup S of Ax is a norm group if there exists an extension 
L/K such that S = Nmz/x (Ar). 


Theorem 27.4.13 (Bijective correspondence): tim:abstract-bijection Let (G, {Gx}, {Ax}, invz/xK) 
be a class formation. Then there is a bijective correspondence between finite abelian exten- 
sions of K and the set of norm groups of Ax, given by 


De Nmz/x«(Az). 


Furthermore, this is an inclusion-reserving bijection that takes intersections to products and 
products to intersections: 


LOM => Nmz;x(Az) 2 Nmyyx(Am) 
Nmyz.p//K(Az.1/) = Nmyz/« (Az) M Nmy/x«(Ay’) 
NmyparyK (Atay) = Nmzx« (Az): Nmyp/x(Az). 


Finally, every subgroup of Ax containing a norm group is a norm group. 


Proof. Abbreviate Nmz/x(Az) by Nz. 
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First we show Nyy = N,M Nz. By reciprocity, 


Ni A Ny = ker(¢p/K) N ker(¢r/K) 2 ker(dru/K) = Nuv 


where (*) comes from compatibility of the @ and the fact that the map G(LL’'/Kk) — 
G(L/K) x G(L'/K) is injective. 

If L C M, then Nz > Ny from transitivity of norms. Conversely, if Nz > Ny, then 
by the above Np = NzNu = Nim. Thus [Ag : Nz] = [Ax : Nrw}, and reciprocity gives 
[L: Kk] =|LM: K],ie. LM =L,ie. LCM. Thus, L + Ny; is an inclusion-reversing 
bijection. 

Next we show that every subgroup containing a norm group is a norm group. Sup- 
pose N;, © N; we show N is a norm group. We have that ¢,/;~% maps N isomorphically 
onto G(L/K'), where K’ = L%/*(%), the fixed field of ¢z/x(N). Consider the following 
commutative diagram from Theorem [27.4. 10} 


Ag 7) 


ss, 


G(K'/K). 


From this we find 
N= ker(@x1/K) = Nr 


as needed. 

Finally, we show Nyapy = Nz- Ny. Note LOL’ is the largest extension contained in 
both Z and L’, while N;- Nz, is the smallest group containing both Nz; and Ny), and it isa 
norm group by the above. Since L +> Ny is an inclusion-reversing bijection, we must have 
Nin = Ni: Ni. 


4.2 Class formations for local class field theory 


As promised, we apply the results of the last section to (G(K/K), K) where K is a local 
field. (In the global case we will set A to be the ideles instead.) 


Theorem 27.4.14: tim:ct-ciass-form Let L be a local field. Then 
(G(K/K),{G(L/K) : L/K finite Galois} , K) 

is a class formation. 

Proof. We verify the axioms of class formations. 


1. H'(L/K) = 0 for every cyclic extension of prime degree, by Hilbert’s Theorem 90 


(26.1.1). 
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2. Take the composition of the isomorphism H?(K) = H?(K™/K) of Theorem |27.3.1 
with the invariant map H?(K™/K) > Q/Z to get 


invk : H?(K) > Q/Z. 


The maps invy;x« : H?(L/K) @ H?(K) > Q/Z are isomorphisms onto their image, 
which much be wKZ/Z. 


Now we verify that 
inv, oResx/p =ninvk, n=([L: Ki). 


This follows from the following commutative diagram. From Theorem ]27.2.3| the 
right square commutes; from the fact that inflation commutes with restriction (by 
functoriality), the left square commutes. 


inv ur /K 


H?(K) —@ H?(K"/K) —+Q/Z 


[a 


Resi /1 Resx/1 n 
H?(L) BE H?(L™/L) —*Q/Z. 
(Note that the target of the restriction in the middle is H?(K™/L), which is a subgroup 
ot A" /0),) 


Applying results about class field theory, we get the main results of local class field theory, 
restated below. 


Theorem (Local reciprocity law, Theorem |24]24.2.1): For any nonarchimedean local field 
K, there exists a unique homomorphism 


dx: KX = G(K*®/K), 
called the local Artin (reciprocity) map with the following properties. 


1. (Relationship with Frobenius map) For any prime element 7 of K and any finite un- 
ramified extension L of K, ¢x«(7) acts on L as Froby/« (7). 


2. (Isomorphism) Let p; be the projection G(K??/K) + G(L/K). For any finite abelian 
extension L/K’, ¢« induces an isomorphism @,/« : K*/Nmyz/«(L*) > G(L/k) mak- 
ing the following commute: 


KX —_™ 4 G(K*/K) 


| [ 


K* / Ninpjxe(L*) 2 G(L/K). 


~ 
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3. (Compatibility with norm map) For any K C k’, the following diagram commutes. 


K'™ OK! G(K” /K’) 


Nm /K e| Kab 


K* * G(K*/K) 


Proof. By Theorem |27.4.14| (G(K/K),{G(L/K) : L/K finite Galois} , K) is a class forma- 
tion. By the Abstract Reciprocity Law applied to Ax = K, we thus have an isomorphism 
K*/Nmp/K L* => G(L/K)*. These maps are compatible by the first and fourth diagrams 
in Theorem 

Next we show that @x(m) acts on L as Froby;x. For the first, we use Theorem 
which says 

X(¢1/K(7)) = inv« (7 Udy). 

We calculate the invariant map on 7 U dy, recalling that the map H'(G,Q/Z) > Q/Z is 
evaluation at the Frobenius: 


H?(L/K) ———-> H*(G, Z) ———_ H1(G,, Q/Z) ———> Q/Z 


™ Udx —— v(m) Udx = 1U 6x ¢ lux x(Frobz/x). 


Thus x(¢r/«(m)) = x(Frobz/x) for all characters x on G(L/K), and ¢p/«(7) = Frobz/x. 
We will prove uniqueness in Section 


Proof of norm limitation, Theorem This follows directly from Theorem |27.4.14) and 
Theorem |27.4.11 


5 Examples 


Before we move on to the existence theorem, we seek to understand the reciprocity map a 
bit better. 


5.1 Unramified case 


The reciprocity map is easiest to understand for unramified extensions. 


Example 27.5.1: cx:uramifiearee Suppose L/K is an unramified extension of local fields of 
degree n (possibly infinite). Then the reciprocity map is 


byw: K*/Nmype(L*) & K*/n™UK > G(L/K) 


a+} Frobr7x . 
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Proof. There are many ways to see this. We know that any uniformizer maps to Frobz x. 
But the uniformizers generate K™~, so @,/x must be the map a> Frobyy. As Frobz/x has 
order n, the kernel is "2U x. 

Alternatively, in the proof of Theorem above, run the argument with arbitrary 
a instead of 7. 


5.2 Ramified case 


To understand the reciprocity map on ramified extensions, we have the following. 

Proposition 27.5.2: prunitto-inertia For any Galois extension of local fields L/K, 
orjx(UK) C I(L/K), 

where [(L/K) is the inertia group. 


Proof. By Theorem |15}15.7.2) L//*) /K is the maximal unramified subextension of L/K, 
so Ux C ker(@,12/x)/) from Example }27.5.1} This means that ,/«(Ux) projects trivially 
on G(LIV/N/K), i.e. brjx(Ux) C I(L/K). 


In fact, the reciprocity map relates filtration on the unit group Ux with the filtration on 
ramification groups (cf. Section |22/4.2), so Proposition |27.5.2| is just the beginning of the 
story. 

Theorem 27.5.3: The reciprocity map transforms the filtration 
K* / Nmzjx(L”) > Ux/ Nmzjx(Uz) D UP / Nmiyjx(UP) D --- 


into the filtration 
G(L/K) D Gs i(hiK) 2 G(L/K)' Dene, 


Proof. This uses more about local fields and local symbols than we’ll prove. See Serre [Ser79}, 
Chapter XV or Neukirch |[Neu99], V.86. 


Example 27.5.4: For the totally ramified extension Q,(¢, )/Q,, the reciprocity map sends 
p’(1 + (p")) 4 G(Qp(G=)/Qo(Gp)). 
The RHS is the rth upper ramification group G". 


Explicit computation of the reciprocity map in the ramified case is difficult without 
Lubin-Tate Theory. 
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6 Hilbert symbols 


sec:hilbert-symbo! TO prove the existence theorem, we need to show that every closed subgroup 
of G occurs as a norm group, i.e. as the kernel of some Artin map ¢,/%. To do this, we 
explicitly construct field extensions L/K that give these norm groups. We will construct 
Kummer extensions, extensions that come from adjoining an nth root. We focus on these 
extensions for several reasons. 


1. Recall that we don’t have a way to directly calculate the action of @z/%. Instead, we 
calculate indirectly by Theorem 27.4.9} If we know y(¢z/«(a)) for all characters on 
G(L/K), then we have determined ¢,/% (a). 


An easy source of characters comes from Kummer Theory |26]26.2.2| since the group 
of characters is isomorphic to a cyclic group.® 


2. We want to show that certain subgroups of norm groups are also norm groups. After 
verifying several topological properties of ¢«, we can reduce this to a statement about 
pth powers/roots of norm groups. In the abstract existence theorem [27.7.2 properties 
1 and 3 are easy to check; they are basically the reductions that allow property 2 to 
be sufficient. 


Recally from Proposition [26]26.2.2| that K*/K*" = Hom(G(K°/K), un). Thus the charac- 
ters we get are in bijection with elements of kK*/K*". We can also consider a € K™ as inside 
K*/K*", and this gives us a sort of “duality”: the Kummer pairing. We will see eventually 
that this is the source of reciprocity laws (Section 29] 1), so these symbols are good for more 
than just proving the existence theorem. 

We assume throughout that K contains a nth root of unity, and char(IX) { n. 


Definition 27.6.1: Let G = G(K*/K). Define the local symbol 


(, nt H°(G,Q/Z) x H°(G,K) — H*(G, KK) =Brx 
Kx 


(x, b) = BU dx 


Here 6 is with respect to the exact sequence 0 > Z > Q — Z/Q > 0 and 6 is the image of 
Ke ine (G, Kh), 
We will drop the subscript n when the context is clear. 


Since cup product is bilinear and 6 is linear, (, ) is bilinear. If A is local, by Theo- 
rem |27.4.9| we have for any Galois L/K and any character y on G(L/K), 


eq:invk-cale INV K(X, Ox/K (@)) = invK(aU bx) = x(¢z/K(a)). (27.5) 


As promised, we now transfer this action to K*/K*". 


®° Artin-Schreier theory, from exercise [26]2.1, is another source of characters. 


390 


Number Theory, 827.6 


Definition 27.6.2: Suppose K is a local field, and let G = G(K*/K). For a € kK”, define 
the character as in Proposition |26]26.2.2| by 


a 1 
yal) = 222, we (G, 44/2) = HG, tn), 


where G = G(L/K) and L = K(a~). Here we choose a root of unity ¢ to make a correspon- 
dence =Z/Z = pin. 
Define the Hilbert symbol by 


1 
K* x K* > Brx[n] = —Z/Z & pn, 
n 
(6,0) = (0) — BU oR, 


If K is a global field, let (a,b), denote the Hilbert symbol where a,b are considered as 
elements of Ky. 


Note that the image is in Z/Z, not just in Q/Z, because nxq = 0. 

We'll abuse notation and not make a clear distinction between Brg{n] = +Z/Z & pn, 
where Brx|n] denotes the n-torsion subgroup of Brg. The first isomorphism is given by 
invz and the second by 4 +> ¢. We transfer the y, from being defined on jz, to 4Z/Z, then 
transfer back from Brx[n| = +Z/Z to ju, at the end, so we may as well use the formula 
for the y, treated in H'(G, p,). 


The following relates the Hilbert symbol to the Artin map. 


Proposition 27.6.3: pr:hilbert-explicit We have 


(a,b) = [or/K(b)]( </a) 


a 


where L = K( <a). 


Proof. Formula (27.5) gives (remember we’re identifying Bre = 4Z,/Z = Ln; by abuse of 
notation we drop the “inv,” because it is an isomorphism) 


[ox (b)|(X/a) 


(a,b) = (Xa; bx/K(0)) = Xalbz/K(0)) = 


S 


where L is any field Galois over K, containing ¢/a. 


Theorem 27.6.4: thm:nitbert-bitinear The Hilbert symbol descends to a nondegenerate skew- 
symmetric bilinear map 
K*j/K" x RIK = fi, 
satisfying the following. 
K(az)*). 


1. (a,b) =1iffbe NM lady sic! 
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2. Ifae K*,x2€ K*, and <2" —a#0, then 
(2° =o)= 1. 
In particular, (a,—a) = 1 = (a,1 — a). 


Proof. Everything that went into defining (,) was linear in either variable (cup products, 
evaluation homomorphisms, snake lemma morphism), so (, ) gives a bilinear map K* x K* — 
Bins 

Suppose x is an element of order n. Then its kernel ker(y) has index n in G(K°*/K). 
Under the Artin map this corresponds to a extension L, of degree n, such that ker(y) = 
bx (Nmz,/x(LY)). Then 


(x, b) = x(x (b)) =0 = > ox(b) € ker(x) 


iff be Nmyz,/«(LY). 
We apply this to y = ya. Note that y has order [K (a) : Kk} and Xa(G(K*/K(an))) =0. 


Hence ox(Nm,, (K(az)*) C ker ya. By comparing indices in G(*/K), equality holds, 


(az)/K 
giving the first item. 


For the second item, note that 
n—-1 4 
x" —a= |] (- Gat) 
j=0 
(for any choice of nth root). The factors in the product can be grouped into conjugates over 
K, so 2” —a is a norm from K(a")/K. Then (a,2" — a) = 1 from the first item. Setting 
x = 0,1 gives (a, —a) = 1 and (a,1 —a) = 1. 
To show skew-symmetry, note from item 2 and bilinearity that 


1 = (ab, —ab) = (a, —a)(a, b)(b, a)(b, —b) = (a, b)(b, a). 


To show nondegeneracy, suppose b € K™* such that (a,b) = 1 for all a € K*; we show 
b€ K*". The condition (a,b) = 1 translates into ya(@«(b)) = 1 for all a. Now the image 
of @x is dense in G(L/K)? (because it is surjective for every finite extension L/K, and 
G(L/K) has the profinite topology). Hence x, = 0. This means an eK ig. ae k™. 


Corollary 27.6.5: cornitbert-tocai Suppose K is a local field, K (ax) /K is unramified, and 6b is 
a unit in kK. Then (a,b) = 1. 

If K is a global field, then (a,b), = 1 in K, unless either a or 6 is not a unit in K,, or 
K(ax)/K is ramified (which happen at finitely many places). 


Proof. Since K(ar)/K is unramified, Ux C Nm, (K(an)*). The result now follows 
from Theorem [27.6.4 
The second part says that (a,b), = 1 if a,b are units in K, and K(an)/K is unramified, 


which is clear from part 1. 


(an) /K 
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Remark 27.6.6: In fact, (a,b) = i(va U x») where 7: H?(G,Z/nZ) > Brx. (See Serre, p. 
207.) This explains the symmetry better but takes more work to prove. 


7 Existence theorem 


sec:local-existence We show that the existence theorem follows from several further (topological) 
axioms on formations. We then prove that in local class field theory, these axioms are 
satisfied. 


7.1 Existence theorem in the abstract 


First, a definition. 


Definition 27.7.1: Let (G,{Gx}Kex,A) be a class formation. The universal norm 
group Dx of K is the intersection of all norm groups of Ax: 


Dr = a Nmyz/x« (Az). 
L/K 


Theorem 27.7.2 (Abstract existence): thm:abstract-existence Suppose that (G,{Gx}Kex,A) isa 
formation satisfying the following conditions. 


1. For every extension L/kK, the norm homomorphism has closed image and compact 
kernel. 


2. Let [p] denote the map «++ px on A. For every prime p, there exists a field kK, such 
that for K containing K,, ker({p]|4,.) is compact and im([p]|4,.) contains Dx. 


3. There exists a compact subgroup Ux of Ax such that every closed subgroup of finite 
index in Ax containing Ux is a norm group. 


Then a subgroup of Ax is a norm group iff it is closed of finite index. 


If the conclusion holds, nAx C Dx for every K, because nAx is closed of finite index 
and hence a norm group. Conversely, Dk C f,3;nAxK because every norm group N has 
finite index so n kills Ax /N for some n. Furthermore, Dx must be divisible: else we could 
find a norm group N D Dx, and n such that nN DZ Dx, even though nN is still of finite 
index. (Note we write Ax additively here, but in class field theory, Ax = K and nAx 
actually means Aj.) The most important condition is item 2, because it will give us these 
two conditions. This gives us a large set of norm groups, and items 1 and 3 (which are more 
topological in nature) will give us the rest of the desired norm groups. 


Proof. Step 1: Suppose axiom 1 holds. We show that for every extension L/K, Nmz/« (Dz) = 
Dr. 
By transitivity of norms, Nmp/x«(Dz) C Dr. 
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Conversely, suppose a € Dx. Since a € Dx, for any extension M/L, Ay contains an 
element b such that Nmyjx«(b) = Nmz/x Nmyjz(b) = a. Thus 


Su a Nmz jx (a) al Nmy/1(Am) 


is nonempty. Since Nm has compact kernel, the first group is compact; since Nm has closed 
image, the second group is closed; thus Sj, is compact. Since the Sy, for all M/L form 
a directed system of compact subsets, S = (\,j, Si is nonempty. Any element of S is an 
element of Nm;z x(a) NM Dy. This shows a € Nmz/x«(Dz). 


Step 2: Suppose axioms 1 and 2 hold. We show Dy, is divisible and 


Dr = a nAK. 


n>1 


First we show that for every prime p, pDx = Dx. Let L be a field containing K,, 
a € Dx, and set 
St = [p)*(@) A Nmp;x At. 


Since [p]~'(a) is compact (as ker([p]|4,.) is compact by axiom 2) and Nmz/x Az is closed, 
Si, is compact. Now this set this nonempty: since a € Dk = Nmz/x Dz by step 1, we can 
write a = Nmyz/x 2, x € Dy,. By axiom 2, x = py with y € Ax, sob:=Nmyxy € Sz. Then 
N12K, S; is nonempty as in step 1. Hence a € pDx. 

This shows pDx = Dx, and we get Dk =(Mns1.NDK CMns1 MAK. 

For the other direction, note that na is the norm of any extension of degree n, so 
Mn>1 nAK io Dr. 


Step 3: Assume all the axioms. We prove the theorem. 

First, note that any norm group is closed by axiom 1, and has finite index by the reci- 
procity law [27.4.8] Indeed, by transitivity of norm, it suffices to consider Galois extensions, 
and the reciprocity law says Nm; (Az) has index equal to G(L/K)*». 

Conversely, suppose S' is a closed subgroup of finite index n. We will find a norm subgroup 
contained in S and then apply Theorem Since Ax/S has order n, we get De GC 
nAr CS,so 


() (NUK) =DKNUK CS. 


N norm group 


Since NM Ux are compact (NV is closed and Ux is compact) and S' is open (closed subgroups 
of finite index are also open), there exists N such that 


NOUK CS. 


Note Ux + (NS) is closed of finite index in Ax because N, S' are closed of finite index; 
we show we can replace Ux with Ux + (NMS) above: 


NA(Uk+(NNS)) CS. 
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Suppose a € Ux and a’ € NOS such that a+a’ € N. Thena € N, but NN Ux C S' so 


a € 5 as well. Thus a+a’ € S, as needed. 


Now NN (UK + (N1S)) is is closed of finite index containing Ux, so is a norm group 


by axiom 3. By Theorem |27.4.13| we get S is also a norm group. 


7.2 Existence theorem for local class field theory 


Proof of Theorem We verify that the class formation for LCFT satisfies the three 


axioms of Theorem [27/27.7.2 
1. To see that the norm map is closed, note that 


Nmz/x(L”) NUK = Nmz/«(Uz) 


because an element is a unit iff its norm is a unit. As Uz; is compact and Nmz/x 
is continuous (Proposition 21}21.1.6), Nm,/«(Uz) is compact and hence closed. Now 
Nmz;x(L*) is a union of translates of U;, therefore closed as well. 


The kernel of Nmz/x is a closed subset of Ur, hence compact. 


. Take K, containing all pth roots of unity. The kernel of the pth power map is the pth 
roots of unity, which is a compact set. Suppose K > K,, and let b € Dx be a universal 
norm. Then (a,b) = 1 for all a by Theorem[27.6.4] Since the pth power Hilbert symbol 
is nondegenerate on K*/K*?, a € K*?. Thus De C K™?, 


. Take Ux to be the group of units of K*. The closed subgroups of finite index containing 
Ux are just t’“Ux for n 4 0; these are the norm groups of unramified extensions of 
degree n by Proposition |27.5.1| (Note these extensions exist—just adjoin appropriate 


roots of unity.) 


Proof of Theorem This follows from Theorem |27.4.13} Theorem |27.4.14] (class for- 


mation for LCFT), and the existence theorem just proved. 
Note the existence theorem gives the following. 


Corollary 27.7.3: coruniv-norm1 The universal norm group Dx is {1}. 


Proof. All open subgroups of finite index are norm groups by the Existence Theorem ]24]24.2.3 


The intersection of all open subgroups of finite index is {1}, aS Nmnn(1+(a™))a™ = {1}. 


8 ‘Topology of the local reciprocity map 
We now prove that @x gives a topological isomorphism K* > W(L/K). 
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Proof of Theorem|24]24.2.4, By Proposition |27.5.2| @z/K(UK) C I(L/K), so we have the 


commutative diagram 


1 >UK » k* “ »Z >1 


bw ae 


1—> I(L/K) —> G(L/k) —> Gi(l/k) —> 1. 


where the rightmost vertical map sends 1 to the pth power Frobenius (p = |k|). The vertical 
maps factor as 


eq:decomp-K/nmL 1 —? UK / Nmz/« (Uz) — > Gar Nmz/x(L*) =e Z/fZ —> 1 (27.6) 


=| 61/1 =| 61/1 =| 


1 ———_> I(L/k) —————> G(L/k) ————> G(I/k) —> 1. 


where f = [I : k]. Recall ¢% = lim, ¢1/x- The intersection of all norm groups is {1} by 


Corollary |27.7.3, so dx is injective on Kk”. 


In forming ¢x = lim éL/K, We are really considering the embedding 


K* > K* := lim K*/ Nmzx(L”) > G(K°/K). 


Decomposing K*/ Nmz/x«(L*) as in (27.6), we have that 
1. lim | Ux/ Nmz/x(Uz) = Ux since Ux is compact, hence complete, so Ux = Tie Kk). 
2. lim, Z/fZ =Z. 

Thus KX <> K® is the embedding Ux x 7“ 4 Ux x rh. 
Recalling that W(L/K) is the inverse image of Frob” C G(k/k), we get dzjK : KX > 


W(L/K) is a topological isomorphism. In summary, we have the diagram 


1 Uk » K* » 1 


=|ox =| 6x = 


1 —> I(K*>/K) —> W(K**/K) ————> Frob* ————> 1 


| | 


G(K*>/ kK) —— Frob® = G(k/k) —>1 
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8.1 Uniqueness of the reciprocity map 


sec:lcft-uniqueness Finally, we prove uniqueness. This finishes all the proofs of local class field 
theory. 
We first restate Lemma |24]24.6.7 


Lemma: Suppose that K is a nonarchimedean local field, AK is the maximal abelian unram- 
ified extension of AK, and L is an abelian extension containing A™. Let f : K* > G(L/K) 
be a homomorphism satisfying (1) and either (2) or (2)’: 


1. The composition k~ Es G(L/K) + G(K™/K) takes a to Frobgur/x (7). 
2. For any uniformizer 7 € K, f(m)|x, =1, where 


Ky = LIK), 
2’. For any finite subextension K'/K of K,, we have 
f(Nmxi/x(K)) |x: = {1}. 
Then f equals the reciprocity map ox. 


Proof of Lemma It suffices to prove this for L = kK”. We have the split exact 
sequence 
gapleteanig | — Ux — K* = Z, —> 1, (27.7) 


where the splitting is determined by the map Z — K™* sending 1 +} 7, and the map 


k* — Ux sending a ++ —%y. Under the Artin map, (27.7) gets sent to the split exact 
sequence of topological groups 


1 > I(K*®/K) = G(K/K™) > W(K*®/K) > W(K™/K)=Z—1 
by Theorem |24]24.2.4| This gives the exact sequence 
1> G(K™"/K™) — G(K®/K) > G(K"/K) > 1, 


where the splitting is by the map Z & G(K™/K) > G(K*”/K) sending 1 ¢x(z). This 
identifies G(K?>/K™) with the quotient group G(K,,/K) where 


K, = Liee(™)) = Lox). 


If (2)’ holds, then for any uniformizer 7, we have that 7 € Nmx-/«(K"™) for every finite 
subextension K’ of K,. Then (2)! gives that f(7)|«, = 1. Then (2) holds. 

We now show if (1) and (2) hold, then f = ¢. Indeed, (1) and (2) imply that ¢(7)|Kuk, = 
f(m)|Kurx, for any uniformizer 7. But K™K, = K?> and the set of uniformizers generate 
K™* (any unit is the quotient of two uniformizers). Hence ¢ = f. 
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Proof of uniqueness in Theorem |24124.2.1, Suppose ¢’ is another map satisfying the condi- 
tions of Theorem |24]24.2.1| It suffices to show ¢’ satisfies the conditions of Lemma|}24|24.6.7 


with L = K*». By assumption it satisfies (1). For condition (2)’, we have x (7)|x, = 1 by 
definition of K,. Hence 7 is a norm from every finite subextension of K,. By condition 2 of 


Theorem |24}24.2.1} this shows Ger (Nmx/K(K")) = {1} for every subextension K’/K of 


L, as needed. Hence ¢' = ¢. 


Problems 


1. Using ¢x, construct a natural bijection between the following two sets. 


e continuous characters W(K/K) — C% (i.e. continuous representations W(K/K) > 
GLi(C)). 


e continuous character K* — C (i.e. continuous homomorphisms GLi(Kk) > 


GL(C)). 


This is the “local Langlands correspondence for GL, over kK.” Local class field theory 
generalizes more naturally in this form. 
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Chapter 28 


Global class field theory 


ect To prove the global reciprocity law we need to do two things, namely construct a map 


OK : ii Nmyz/x TD = G(L/K), 


and show that it is an isomorphism. To show it is an isomorphism, we need to show that 
the two sides have the same cardinality:' 


| K/K* Nmyp/K L| = [L : K\. 


The first inequality “>” will be shown using cohomology, with lots of Herbrand quotient 
calculations. The second inequality “<” is most easily shown with Z-functions, but can also 
be shown with a more complicated cohomological argument. 

To construct a map, there are two approaches. We can define ¢x to be the map whose 
components are the local Artin map, and use the properties of the local Artin map given 
by local class field theory. Alternatively, we can construct it directly in the global case, 
without using local theory, and get local class field theory as a corollary. We will take the 
first approach. For an account of the second, see Lang [Lan94]. 


1 Basic definitions 


First, some basic definitions. 


Definition 28.1.1: Define the action of G(Z/K) on I, by permuting the places: For an 
idele a = (d,)yev,, define oa by 
(ca)ow) = o(a,). 


Definition 28.1.2: Define the inclusion map Ix @ Iz, by 


(av )vevic > ((dv)wle) reve? 


‘More precisely, we use this to show the invariant map is an isomorphism, then get the Artin map from 
the machinery of class formations. 
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i.e. it is induced by componentwise inclusions K, @ L”. Let the inclusion map Cx @ Cy, 
be induced by the above inclusion. 
For an infinite extension M/K, define 


m= li is Cu= ili Cr 
KCLOM KCLOM 


where the limit is taken over finite Galois extensions L/K. 


For short, let H"(L/K,A) denote H"(G(L/K), A) and H?(K, A) := H"(K/K,A). As 
in the local case, H"(L/K) denotes H"(G(L/K), K*). 


Proposition 28.1.3:  prigisix Let ee be a Galois extension and G = G(L/K). The inclu- 
sion map Ix — I sends Ix =e ‘J © and the inclusion map Cx <> Cz sends Cx ay Cr. 


Proof. The first part holds because G acts transitively on all the places in L dividing a single 
v € Vx, so any element of I¢ has to be constant on all w | v, i.e. in the image of Ix. 
For the second part”, take the long exact sequence in cohomology associated to 


13 L* 3 LtoCcr,ol 


to get 


1 —+ H°(G, L*) —~ H°(G,1,) —~ H°(G, C,) —> H1\(G, L*) 
K 


where the equality on the right is Hilbert’s Theorem 90 eee 26]26.1.1) and the map 
Kk — C@ is induced by inclusion. Thus C¢ = Ix/K* = Cx. 


2 The first inequality 
In this section we will prove the following. 


Theorem 28.2.1 (First inequality of global class field theory): ‘rst-incquaiity If L/K is cyclic, 
then 


| K/K* Nmp/K L| z [L : K}. 


To prove the inequality, we first express the left-hand side in terms of cohomology. Letting 
G = G(L/K), we know that 


Hy(G, Cr) = Cx/ Nmz/x Cr = 1x/K* Nmz/x 


Sy 


?which isn’t obvious, because we’re taking quotients here 
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Then noting that the Herbrand quotient (with respect to G) of Cy, is h(C;) = ee 
yd ? 
we have that 
aqet-ineq-berbrand| nik” Nmy/K L| = |HZ(G, Cr)| > h(C,z). (28.1) 


To calculate h(C_;) our plan is as follows. 


1. First express C, in terms of something involving a finite set of places; we find T' so 
that 


x7 


i : _ h(I?) T 

2.2). = 

(Proposition |28.2.2). Then calculation shows that h(C_;) RUT) where U;, denotes 
L 

the T-units in L. 


2. Compute h(I?) = [],¢5 nv. Note I? is a direct product, not a restricted direct product, 
so we can just take the product of the Herbrand quotient of the factors. Breaking up 
the places into G(L/K)-orbits, we can calculate h(I?) using the corollary to Shapiro’s 
Lemma [25]25.8.7/ 


3. Compute h(U?) = +Tl,eg Mv by relating it to a lattice of codimension 1 in R* by the 
log map, where s = |S|. (See ANT, Chapter [18}) We use the fact that the Herbrand 
quotient of a full lattice depends only on the vector space it resides in (Theorem 28.2.5) 
to change to a more convenient lattice whose basis consists of vectors representing the 
s places in U?, i.e. the lattice A = T],,c5 Zew. 


The set S breaks up into G(L/K)-orbits, so the lattice breaks up into induced S- 
modules, and we can calculate h(U?) using again using Shapiro’s Lemma |25]25.8.7 


4. Putting all the steps together gives 
h(Cz) =ESn, 


as needed. 


2.1 Reduce to finite number of places 


Proposition 28.2.2: inis Let L be a number field. There exists a finite set of places T of L 
such that 


x7 


Proof. This basically follows from the finiteness of the class group. 
For the first part, consider the map p: I, — Clz, defined by sending 


(Qoev, > TI p%. 


v=vpEVP 
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(Map a to the prime ideal whose valuation at each prime equals the valuations of the cor- 
responding coordinates of a.) The kernel—the set sent to the principal ideals—is L*If~, 
where V@ is the set of infinite places. Thus we have an isomorphism I;,/L*I¥~ — Cl’. 
The latter is finite; take the inverse image of a set of generators A. We can choose finite 
T containing V© so that the coordinates of elements of A are units outside of T. Then If 


generates I, /L™*, as needed. 


2.2 Cohomology of I? and I; 


Proposition 28.2.3: nic Let a be a Galois extension of number fields. Let S be a set 
of places in K and let I? := If where T = {w € V;: w |v for some v € S}. Then for any 
1 > 0 we have 
HG) = | Pew / kyl) x Tl BG Ke) .0). 
ves ves. 


This is also true for Tate groups if G is finite. 
In particular, if L/K is cyclic, and S' contains all ramified places, then 


H'(G, . =1 
HG 1? =e —2/2 
ves | 
h( 2) = Il Ny 
ves 


where ny is the local degree [Ly : K,], for any w | v. 


Proof. We have 


where U,, := Ux,,. We calculate the cohomology groups of each factor. 


H (<. Il vi) =H" [« Il 114 


welt veES wiv 


= Il ely [« Il vs) cohomology respects products, Proposition }25]25.6.7 


ves wiv 
=| ne ,L) by Corollary |25]25.8.8]to Shapiro’s Lemma 
ves 
eq:hiltl = = |] H'(G G(L"/Ky Ne Le) (28.2) 
ves 
Cy iA, 
eq:hilt2 = 7 (28.3) 
Mucs £Z/Z, i=2. 


3cf. Example |24.5.10} there IY” is written as Ur. 
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For 2 = 1, the last result follows from Hilbert’s Theorem 90, and for 7 = 2, it follows from 
the fact that invx, : H?(G(L/K,),L’*) — +Z/Z is an isomorphism (a consequence of 
the class formation for LCFT, Theorem |27]27.4.14| or actually just Theorem |27]27.3.1) and 


Proposition |27]27.2.1). 


For the units, we have, 


eq:hilts HT” [« Il v.| = |[ A (G(L"/K,), Uw) Proposition [25]25.6.7 


weT v€S 
(28.4) 
eqhiita = 1 if T’ unramified, by Theorem |27]27.1.1 
(28.5) 


For the general case, take the product of (28.2) and (28.4). For the special case, take the 
product of (28.3) and (28.5). The Herbrand quotient calculation follows directly. 


If we consider the full group I, we get the following result. (We won’t need this until 


Section [5}) 


Proposition 28.2.4: prniasproa For any Galois extension L/K with Galois group G and any 
n > 0, we have 


H"(G,1,) = @ A"(L’/K,). 
vEVK 
This is also true for Tate groups when G is finite. 
In particular, we have 


1. H(G,I,) =0. 
2. H°(G,In) = @revy 2Z/Z. 


Proof. We have 


Lb lim ie 
S' finite 


Hence using Proposition }25]29.14.3} 


H"(G,1,) = H"(G, lim I? 
( ) ( Hm I) 


= lim H"(G, 15) 
{ira , Tues H"(G", £°*) x Togs H"(G",U") = Brev, H"(G", L’*), general case 
= 41, n=1 
{© 1Z/Z, p=2 


veEVK Ny 


where the last statement follows from Proposition |28.2.3 
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2.3. Cohomology of lattices and U? 


Proposition 28.2.5: cohomattice Suppose G' is finite cyclic, V is a finite real vector space and 
R{G]-module, and M, N are two lattices in V, stable under the action of G. Then 


h(M) = h(N). 


(If one is defined, so is the other.) 


Proof. We proceed in 2 steps. 


Step 1: We show that M &zQ = N ®@zQ as G-modules. We know M ®zR=V=WN @zR. 
Suppose V = R”. Choose bases {(;} for M and {7} for N. Let B(o) and C(c) be matrices 
representing the action of a generator o € G' on these bases.* A linear map M@zR > N®zR 
represented by a matrix A with respect to {6;} and {y;} is a isomorphism of G-modules if 


A-B(o) =C(a)-A. 


These determine a system of homogeneous linear equations in the entries of A, with coeff- 
cients in Z, since B(a) and C(c) have entries in Z. 
Letting the solution space be W C M,,x,(IR), we have 


dime W = dimg(W mM) Mrxn(Q)), 


because Gaussian elimination never needs to leave the world of Q. Hence we can find a 
basis for W contained in Mnxn(Z), say {A1,..., Ax}. By the existence of an isomorphism 
between M ®z R and N @zR, there exist a,,...,a, € R such that a,A,; +---+ a,A,x is 
nonsingular, i.e. 


det (a, A, apne Se apAr) a 0. 


The left hand side is hence a nonzero polynomial in the a,; since it has coefficients in the 
infinite field Q it has a solution over Q. Taking A to be the corresponding linear combina- 
tion, we get the desired G-isomorphism M @z Q—-> N @z Q. 


Step 2: We have an isomorphism f : M @zQ > N &zQ; by scaling f (since M,N are 
finite-dimensional lattices) we may assume f restricts to f : M — N. Now N/f(M) is 
finite; hence by Proposition |25}25.12.4(1) and (2), 


h(N) = h(M)ACN/F(M)) = AM). 


Proposition 28.2.6: iu: Let L/K be a finite cyclic extension of number fields of degree n. 


Let S' be aset of places in K containing the infinite places and T = {w € V;: w |v for some v € S}. 


We have i 


welt 


4G cyclic is not important here; we could work with all elements of G. 
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where U? denotes the T-units in L and n,, is the local degree [L,, : K,], where w | v. 


Proof. Consider the map L : U? — R® defined by letting 


L(a) = (ln |alw)wer 


where | - |, is the normalized valuation. Then L(a) is a lattice of dimension |7'| — 1 by 
Dirichlet’s S-unit theorem |18/18.3.2; it is in the hyperplane where the sum of coordinates 
is 0 (take the log of the product formula [20]31.1). The kernel of LZ consists the roots of 
unity in L, 2 L, which is a finite group. By Proposition |25]25.12.4(1)-(2) applied to 
13> pNL- Uf = L(UF) 5 0, 


taeah(U7) = h(n L)AG(Uz )) =n L(U7)) (28.6) 


Let G(L/K) act on R? by permuting the coordinates corresponding to the places. Note 
that LZ is a G-module homomorphism with respect to this action. Let x be the vector 
(1,1,...,1); note it is fixed by G(L/K). Note that 


A:= L(U7) @ (1,1,..., DZ 


is a full lattice in R?. By Proposition |25]25.12.4(2)-(3), we have 


vineg-2h(A) = h(L(UZ))h(Z) = n- h(L(UZ)). (28.7) 


Consider the lattice A’ = Z? in R’, where e, is the vector with 1 in the v position and 0’s 
elsewhere. By Proposition |28.2.5| h(A) = h(A’). Since G permutes the places above v € S 
transitively, we have 


h(A) = h(A’) =h (@ out) 


welt 


=h (@ Dez) 


ves wiv 


Il h (@ “) cohomology respects products, Proposition |25]25.6.7 
ves wiv 


= |[ A(G",Z) by Corollary |25]25.8.8] to Shapiro’s Lemma 
ves 
=|[|c’ Proposition [25]25.12.4(3) 
ves 
= ][ 
ves 
Together with (28.6) and (28.7), we get 
1 1 
h(UP) = —h(A’) = — [I mw 
(UB) = {m8 = 7 Tn 
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2.4 Herbrand quotient of C,; 
Lemma 28.2.7: ni If L/K is a cyclic extension of number fields of degree n, 
h(Cz) =n. 


Proof. Choose a set of places T for L containing the ramified places and satisfying the 
conditions of Proposition |28.2.2| Enlarge T so it is stable under G(L/K). Using Proposi- 


tions |28.2.3] and |28.2.6} we have that 


_ xq 7X) — pq qT x, AME) — These _ 
(Cr) = MLME/E") = MEE EY) = pre = Tp = 


Proof of Theorem We have 
«/K* Nmy/x(Iz)| = |H7(G,Cz)| = h(Cz)|Hp'(G,Cz)| 20 


by Lemma [28.2.7 


2.5 The Frobenius map is surjective 


sec-frob-surj Using the first inequality, we can already prove surjectivity of the Artin map, defined 
on ideals. 


Proposition 28.2.8: prtob-suxj Let L/K be a finite abelian extension, and S be a finite set 
of primes. Define the map 
WrjK i 1° > G(L/K) 


by setting w1/«(p) = Froby«(p) for primes p ¢ S and extending to a group homomorphism. 
Then wz/K is surjective. 


Proof. Let H = im(wz/%). By compatibility of the Frobenisus map, Frobg#/«(p) is the 
image of Froby«(p) under the projection G(L/K) + G(K"/K). Hence the map Uxu/K : 
I° + G(K#/K) is trivial, giving (K“)” = K, for every v ¢ S, and 


Ss 
K CNmxu x Ie. 


However, AK “I? is dense in Ix by the weak approximation theorem |20]20.3.4| so AK *I?, 
K* Nga x Ixu = 1x. But by the First Inequality |28.2.1} 


[K™ : K] < [Ix : K* Nmgij¢ Ign] = 1. 


Hence K¥ = K, ie. H =G. 
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3 The second inequality 


We give two proofs of the second inequality, an analytic proof and an algebraic proof. The 
first has the advantage of being short and sweet, while the second has the advantage of 
staying completely within the algebraic realm, i.e. not requiring knowledge of L-functions. 


Theorem 28.3.1 (Second inequality for global class field theory): tnm:2inca For any extension 
L/K of degree n, and G = G(L/K), we have 


1. |H2(G,Cz)| and |H?(G, Cz)| divide n. 
2. (HT90 for ideles) |H'(G,C,)| = 1. 


In particular, 


k/K* Nmy/K L| < [L : K}. 


3.1 Analytic approach 


We first show the inequality |Ix/K™ Nmz/x Iz] < [L: Ky}. 


Proof of inequality. Let ¢ be admissible for L/K, i.e. such that Ux(1,¢) C Nmz;x (Iz). By 
Proposition |24]24.5.9] we know that Ix/K* Nmp/« I, & Ip /Px(1,¢) Nmzjx(J;). We show 
that 


[I& : Px(1,¢) Nmz/x(£§)] < [L: K]. 


Let H = Px(1,¢) Nmp/x Ij and let y be a nontrivial character of Ij, /H, viewed as a character 
ot) / Pe (l,«): 
Define the Hecke L-series L.(s, x) by 


1 


x(a) 
L.(s, x) =|] = s, 
pte I= ae) alc Ta 


where equality follows from expanding the product. Define 
m(x) := ord,1 L,(s, x). 
Since L,(s, vy) = (s — 1)™g(s, x) for some g(s, x) nonzero at s = 1, taking logs gives 


1 
s— 1" 


In L-(s, x) ~ m(x) In(s — 1) = —m(x) In 
Taking the sum over all characters of aw gives 


(28.8) 


XA XA 


ec 
= 
=) 


sae teqemnal |T1 Cx(s) + S- In La(S, x) eS F > S- m(x) 
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where we use the fact that ¢x(s) := L(s,1) has a pole at s = 1. 
On the other hand, by the Taylor series expansion for In, 


In L,( — yon a-% we) Dae lee ys RA) 


s 
pie n=1 pte Rel‘ /H pes, pte p 


1 


where in the last step we grouped together the primes based on what they are modulo H. 
This is greater than the sum if we only include primes with f(B/p) = 1 (% in L). Again we 
are off by at most a constant if we only include primes splitting completely in L, because 
the ramified primes are at most a finite subset. We can then “unrestrict” to all the primes 
of L, and be off by at most a constant in a neighborhood of 1, because the other terms are 
in the form oft for f > 1. 


Let h = [15 : H]. We get, for s > 1*, 


In G(s) + S- In Lm (s, x) ~ ae S> x (8) S> 


ea xX RET*/H pER, p{m 


( 
i 0, RAH 
LOM +h Dp EXA= 1, 
peSpl(L/K) Mp x i R= A, 
h 1 
~O0)+— > =e 
N saya UB 


1 
Mps 


N primes above each p 


~ O(1) + —In¢z(s) 


~O(1)+—In 


Combining this with (28.8) gives m(x) = 0 (since a > 0) for all y 4 1, andh < N, as 
needed. 


3.2 Algebraic approach 


sec:2ineq-alg-proof The steps are as follows. 
1. Carry out some preliminary local computations. 


2. Consider the case where L/K is an extension such that G(L/K) © (Z/nZ)'", and K 
contains the nth roots of unity. Note this is a Kummer extension, so we can characterize 
it in terms of L*" 1 K. This will make computations easy for us. 


We construct an explicit set E with 


EC Nmyzx Iz Clk. 


We have [Ix : K* Nmzx Iz] | [In : K* E], so it suffices to show the latter equals n’. 
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3. Show this. 


4. This implies the cyclic prime case, and that the cyclic prime case implies the general 
case. 


This section is incomplete; see Cassels-Frohlich |CF'69|, pg. 180-185. 


3.2.1 Local computations 


Proposition 28.3.2: pr:local-power-index Let K be a local field with lun OK | a mM, 1.e. K contains 
m nth roots of unity. Then 


[KX : K*"] = 
[2]. 
and 7 
Ux : Uz| = —. 
[ K K| jn]. 


Proof. There are two methods: appeal to the structure of K* or calculate a Herbrand 
quotient. 


3.2.2 Constructing E 


Since Z is a Kummer extension we can write it in the form K(?/a1,..., ¢/a-). Let S bea 
set of primes satisfying the following conditions. 


1. S contains all infinite places. 
2. S contains all divisors of n. 
3. Ix = K*I}.. (This is possible by Proposition |28.2.21) 


4. S contains all prime factors in the numerator and denominator of all a;, i.e. the a; are 
all S-units. 


Define 
E=|[ky"x ][ «ox [[ &. 


ves veT v€SUT 


Lemma 28.3.3: EC Nmz/x Ir. 


We want to calculate [Ix : K*E] but K*E is hard to deal with. E however, is not, 
because to calculate the index of E we can appeal to Proposition |28.3.2| Thus we use the 
following group theoretic fact. 


Proposition 28.3.4: Let B C A and C be subgroups of a group G. Then 


[CA:CBI|CNA: CNB] =[A: Bl. 
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Then 


eth e=|h 7k es) = [SUT : E] 
: Kk: [Kx nm SUT KX) EB] 


See Cassels-Frohlich. 


3.3. Finishing the proof 


Now we prove Theorem ]28.3.1 


Proof of Theorem|28.3.1, Step 1: We show the theorem when [L: K] is prime. In this case, 
both the first and second inequality hold, so 


|H2(G, Cr)| = [ Ks K* Nmz/x«( L)| = [L 4 rae 


Since h(C,) = n by Lemma [28.2.7] we get |H7(G,C,)| = 1. Finally, note H2(G,Cz) = 
H?(G, Cy) because G(L/K) is cyclic. 


Step 2: We show the theorem when [L : K] is a prime power, by induction on the exponent. 
Suppose |G| = p”. Every p-group has a normal subgroup of index p. Let H <«G be such a 
group; it corresponds to H = G(L/K") for some extension K’/K of degree p. The inflation- 
restriction exact sequence |25]25.11.10) gives 


Inf 


0 > H(G/H, Ce) 5 A(G,C,) =  H1(H,Cz) 
—>———__—_—__S e——_ SS 
=0 by prime case =0 by induction hypothesis 
Thus H!(G,C,) = 0. This shows part 2. Using H'(G,C,) = 0, the inflation-restriction 
exact sequence gives 
0 H?(G/H, Cx) @ H?(G,C,) 2 H?(H, Cz) 


_—__ KE 
order p order |p™—1 


by the case for cyclic extensions and the induction hypothesis. This shows |H?(G,C,)| | p”. 
Finally, 


|H2(G, Cr)| = [Cx : Nmy/K C;] = [Cx é Nmxi/K(Cx:)|[NmxrK (CK) ' Nmz/«(Cz)). 


Now [Cx : Nmx1/«K(Cx:)] = p by the cyclic case, and the surjection Nm«/q : Cx / Nmpj« (Cr) > 
Nmxi/« (Cx) / Nmp/x«(Cz) and the induction hypothesis gives that the second factor divides 
p”'. This finishes the induction step. 


Step 3: We show the theorem holds in general, using Corollary |25]25.11.7} the map 
Res” : H"(G, M) > H™(G,, M) 


is injective on the p-primary component. Using step 2, for n = 1, this gives us that p { 
H7(G,C_) for any p, ie. Hp(G,Cz) = 0. For n = 0,2, this gives that v,(|H"(G,M)|) < 
v,(|H"(Gp, M)|) < up(G), giving part 1. 
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3.4 Local-to-global principle for algebras 


The fact that H'(G,C,) = 0 also gives the following corollary. 


Theorem 28.3.5 (Brauer-Hasse-Noether Theorem): tm-rn Let L/K be any Galois exten- 
sion with Galois group G. Then the map 


H?(G,L*) > @ H?(G",L"*) 


vEVK 


is injective. A central simple algebra over a number field K is split over K iff it is split 
locally everywhere. 


Proof. Taking cohomology of 0 — L* > I, + Cy, — 0 gives 


eqo-h-n H1(G, C,) —— H?(G, L*) ——> H*(G,1,) ——--- (28.9) 


Br“ DucVx Brx, 
Here H'(G,Cz,) = 0 directly from HT90 for ideles (Theorem |28.3.1), and equality on the 


right comes from 


HG lz) = GA (L"/K,) 
vEVK 
(Proposition [28.2.4). Brauer group is H? by Theorem [26.5.2] Injectivity of the bottom map 
gives the result. 
(We do need to check that in the above diagram, the map Brx > ®, cy, Brx, is exactly 
the map sending an algebra to its reduction over every local field. This is a matter of tracing 
the long windy road between Br and H? and left to the reader.) 


4 Proof of the reciprocity law 


To construct the Artin map in the local case, we constructed the invariant map invx : 
H?(K™/K) > Q/Z. Then we used the fact that H?(K™) = 0, i.e. every a € H?(K) splits 
in an unramified extension, to conclude that H?(K) ~ H?(K™/K). 

In the global case we will construct the invariant map invg : H?(K./K,Ix.) > Q/Z, 
for a certain infinite cyclotomic extension K,. Then we show H?(K,,Ig¢) = 0, ie. every 
a € H?(K,Ig) splits in this cyclotomic extension, to conclude H?(K, Ig) = H?(K,/K, Ig). 

We construct the global Artin map by taking the product of the local Artin maps: 


se tlk — G(L/K) 
brjK(a = || dy( dy) «eq:artin-as-product (28.10) 


vEVK 
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(Only a finite number of the factors—those where L”/K, is ramified or a, ¢ U,—are not 
equal to the identity.) 

We need to show that A* C ker @z/K, so that it factors through Ix /K* - Nmz/K 
Consider the following two properties. 


(A) Define the map @z/x« as in (28.10). The map ¢,/x% takes the value 1 on the principal 
ideles K* Cl. 


= 


(B) For all a € H?(G(L/K), L*) = Brz/x, 
inv(a@) := ‘> inv,(i(a)) = 0. 


vEVK 
Note in (B), inv, is defined as follows. 


Definition 28.4.1: argiobariny Define inv, as the following composition: 


H?(Gy,pu 
( ) 


inv, : H2(G,I,) 72 H?(G,,I,) H2(G,, (L”)*) ®4 O/Z 


where p, : I, > (L°)* is the projection map. (This looks complicated, but it is just what 
you think it is.) 


We prove (A) for all finite abelian extensions of number fields and (B) for all finite Galois 
extensions of number fields. 

We first show that (A) holds for a special class of extensions, and then use an “unscrew- 
ing” argument to show (A) and (B) hold for more general extensions. The plan of attack is 
as follows. 


1. Show (A) holds for Q(Gn)/Q. 

2. Show (A) holds for all cyclotomic extensions. 

3. Show that (B) holds for a split by a cyclotomic extension. 

4. Every a is split by a cyclic cyclotomic extension, so (B) holds for all a € H?(K, K ja 


5. Show that (A) holds for all abelian extensions. 


Note that (A) is a statement about H;? — H?. while (B) is a statement about H?. We 
“transfer” the problem from (A) to (B) so that we can apply our characterization of ¢, 
in terms of the local invariant map (Theorem [27]27.4.9). First, we need an analogue of 
Theorem in the global case. 


Lemma 28.4.2: lem:sum-inv Let G = G(L/K). For all VE Vx and all xX — H'\(G, Q/Z) a 
Hom(G, Q/Z), we have inv, (aU 6x) = Xu(¢v(av)). (Xv is the restriction of y to G, and @ is 
the image of a in H?(G(L/K),I,). Hence 


inv(aU dy) = $> inv, (aU dx) = x(¢(a)). 
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Proof. Since restriction commutes with cup products (Proposition |25]25.11.9) and with 6, 


we have 


inv, (aU dx) = inv(p, Resgja, (a U 4x)) 
= inv(p,(a) U dx) Rese/a,(x) = Xv 
= inv(G U dyn) = Xu(dv(av)). 


We invoked Theorem |27/27.4.9]in the last step. 
Taking the product gives the second statement: 


x(6(a)) = x (TT 4s(as)) = xo(Gela)) = Dinvy(@U dy) 


4.1 (A) holds for Q(¢,)/Q 
Proposition 28.4.3: For any m €N, 


beacm)/g(Q”*) = 1. 


First reduce to the case where m = p is prime. We give two approaches. 


Proof 1. By Example |24]24.4.3| we know the ideal version of global class field theory holds 
for all cyclotomic extensions of Q. Note the maximal unramified extension of Q, is included 


in Q,(¢..) for all p (Theorem [21]21.2.6). Hence by Theorem [24]24.6.6{2), there is a map ¢' 
satisfying the conditions of the idele version of GCFT, except that ¢, may not equal ¢p. 


Letting ¢/, be the restriction of ¢’ to K,, we have (on G(Q(¢~)/Q)) 


eqidele-ctt-eycotomich’ (a) = bg(ar) T] ¢4,(av) = T] do(av) = ¢'(a) (28.11) 


veVg vEVo 


where the middle inequality is pending a proof that @p = dp. We check this is true. 

Since dg is a map R/Rx») — G(C/R), it suffices to show complex conjugation is in the 
image of ¢’. We have G(C/R) = G(R(z)/R), so consider ¢’ on G(Q(i)/Q). As @'(Q*) = 1, 
we have by and the fact that Q(i)/Q is only ramified at 2 that on G(Q(i)/Q), 


1 = $'(—7) = $5(—7)67(—7) dp(—7) 


Now —7 = 1 (mod 8) so —7 € Nm@,(iy/e@, (Qa(z)*), and ¢3(—7) = 1. We have v7(—7) = 1, so 
@'(—7) equals the Frobenius element, complex conjugation. Hence ¢g(—7) is also complex 
conjugation. 


Thus (28.11) holds, and we have ¢g(¢..)/a(Q”) = %a¢¢..)/9(Q”) = 1, as needed. 


Proof 2. Use explicit computations of local symbols, obtained from Lubin-Tate theory. See 
Cassels-Frohlich |CF69], p. 191. 
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4.2 (B) holds for all cyclomic extensions 

We prove the following more general proposition. 

Proposition 28.4.4 (Devissage): °pr-gi-rec-aevissaze If (A) is true for L/K, then (A) holds for 
1. any subextension M/k and 
2. any extension LK'/K". 


For an extension K'(¢,,)/K’, apply the proposition with L = Q(¢,,) and K = Q to obtain 
the following. 


Corollary 28.4.5: (A) holds for all cyclotomic extensions. 


Proof of Proposition|28.4.4 


1. For any place v, ¢yv/K, is the composition of d;»/K, and the projection G(L’/K,) > 
G(M°/K,). Since the global map is the product of the local maps, @ys/« is the com- 
position of dr/x and G(L/K) > G(M/k). Hence ¢y/xK(K*) = 1. 


2. Let L’ = L- kK’. We have a natural inclusion G(L'/K') — G(L/K). The local Artin 
map is compatible with basefield extension with respect to the norm map. Since the 
norm on ideles is computed componentwise, it follows the map ¢ = [],cy, dy is also 
compatible with field extensions. 


OL /K! 


Priel eG Fg 


[smn | 


po GL e. 


Suppose a € K’*. By commutativity and (A) for the extension L/K, we have 


10 by K(@) = or/K[Nmx:/K«(a)] = 1. 


eEk*x 


Since 7 is injective, this implies ¢«//«/(a) = 1. 


4.3 (A) for cyclotomic implies (B) for a split by cyclic cyclotomic 
This follows from the more general proposition: 


Proposition 28.4.6: If L/K is cyclic, then (A) implies (B). 


°Devissage means “unscrewing” in French. 
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Proof. Since L/K is cyclic, we can take y € H'(G,Q/Z) to be a generating character. We 
have the following commutative diagram. 


Kx ee Cie 


a ie is 


H?(G, L*) —> H°(G,I,) ““ > Q/Z. 


The left-hand square commutes by functoriality of cup products; the right-hand square 
commutes by Lemma |28.4.2| Recall e U dy is an isomorphism for G cyclic, by Proposi- 
tion |25]25.12.1| Hence if a € H?(G, L*), then it is equal to bU 6x for some b € K™, and 


inv(a) = inv(bU 6x) = x(¢r/K(6)) = 0. 
In the last step we use (A) to give ¢z/%(b) = 0. 


4.4 (B) for cyclic cyclotomic implies (B) in general 
It suffices to prove the following. 


Theorem 28.4.7: thm:spiit-in-cye-cye For any 3 € H?(K) there exists a cyclic cyclotomic exten- 
sion L/K such that @ maps to 0 in H?(L). m 

There exists a cyclotomic extension K, C K(¢x) with G(K./K) = Z such that the 
inclusion map 


H*(K./K) + H°(K) 


is an isomorphism. 


We first give a criterion for 3 to map to 0 in H?(L), then find a cyclotomic L/K where 
this criterion holds. 


Lemma 28.4.8: lem:criterion-split Let a € H*(K }: Then Resx/1(@) — 0 in H?(L) if and only if 
[L” : K,]inv,(a@) = 0 for every v € Vx. 


Proof. By the Brauer-Hasse-Noether Theorem |28.3.5| Resx/,(a) = 0 in H?(L/K, L*) iff 
Resx/i(a) = 0 in H?(L°/K,, L’*) = 0 for all v. Since invx, is an isomorphism, this is true 
iff invz, Resx,/r»(a) = 0 for all v. But we know 


invx, Resx,/1»(@) = [L” : K,]inve(a), 
from the class formation for LCFT (Theorem |27]27.4. 14). 


Lemma 28.4.9: tem:cye-cyeext Suppose K’/Q is a finite extension and S be a finite set of places 
of kK. There exists a cyclic cyclotomic extension L/K such that 


m | [L” : K,| for every finite v € S 
2 | [L° : K,] for every real v € S. 
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(The second condition is just equivalent to L being complex.) 


Proof. First consider the case K = Q. Note that for an odd prime gq, 


G(Q(Gyr) /Q) & (Z/a")* = Z/(q — 1)q"* = Z/(q — 1) x Z/qh. 


Let L(g") be the subextension of Q(¢,-) with Galois group Z/gq"~*. Becuase Q, only has a 
finite number of roots of unity, v,([Z(q¢") : Q,]) 4 oo as r > o0. 
Similarly for q = 2, 


G(Q(Car)/Q) © (Z/2")* = Z/2 x Z/2"™. 


The subextension Q(g — Gr) corresponds to the automorphisms ¢ +> ¢° with s = 1 
(mod 4), which form a group isomorphic to Z/2"-?. Let L(2") = Q(Gar — Gy") (note this is 
complex), then similarly lim,_,., vo([L(2") : Q)) = oo. Now take 


L:= JI La) 


qi|2m 


for r; large enough. As it is a compositum of cyclic cyclotomic extensions of relatively prime 
degrees, L is cyclic cyclotomic. 

Now suppose we are given general K’. First construct Q(¢,)/Q satisfying the conditions 
for Q with m[K : Q|. Then take L = K(¢,). We have [K"(¢,) : K”] | m for finite primes v 
of Q since [K” : Q,] | [A : Ql). 

We can take K, = Us,, K - yes £(9;')- 


Proof of Theorem We know inv,(a@) = 0 except for a finite number of primes, say 
primes in S. Why? Suppose minv,(a@) = 0. Use Lemma|28.4.9}to get L = K(¢y) such that 
works for m and S. Then by Lemma |28.4.8| Resx/,(a@) = 0 in H?(K(¢y)). 


4.5 (B) implies (A) for all abelian extensions 


This will follow from the following proposition. 
Proposition 28.4.10: If L/K is abelian, then (B) for L/K implies (A) for L/K. 
Proof. Let a € K*. By Lemma|28.4.2} for any character y, 


x(¢z/x(a@)) =inv( @Udx )=0. 
ae 
€H2(L/K,L*) 


Hence $1/x«(a) = 0. 
We have now proved the following. 


Theorem 28.4.11: tnm:as The following hold. 
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(A) For an abelian extension L/K, define the map @,/% as in (28.10). The map ¢;/x takes 
the value 1 on the principal ideles K* C Ix. 


(B) For any a € H?(K/K), 


5 The ideles are a class formation 
sec:ideles-ct We now complete the proof of global class field theory by showing that the ideles are 
a class formation and invoking the theorems in Section In the local case, the G-modules 
in the class formations are the fields themselves, but in the global case, the G-modules are 
the ideles. 
Theorem 28.5.1: thm:global-class-form Let K be a global field. Then 
(G(K/K),{G(L/K) : L/K finite Galois} , Cz) 

is a class formation. 

Note that cane = C, for each L by Proposition |28.1.3 
Proof. We check the axioms in Definition |27.4.5 


Step 1: First, H!(G(L/K), C,) = 0 for every cyclic extension of prime degree (in fact every 


finite extension), by Theorem |28.3.1 


Second, we need maps invy/« : H?(L/K,Cx) =; Q/Z. Right now we just have a map 


INV /K 3 H*(G(L/K), L) => Q/Z. 


We need to show inv;/x “factors through” H?(G(L/K),C_,). We also need to show com- 
patibility with inflation and restriction, and that 


1 


INV /K 4 H?(G(L/K), Cz) - [L : kK} 


Z/Z 


for all L/K. It is hard to show this directly, except in the cyclic case, when we know the 
first inequality holds. As we will see, though, showing the cyclic case is enough, because by 
Theorem every element of H?(G(K/K), Cz) is contained in H?(G(L/K),C,) for 
some cyclic (in fact, also cyclotomic) L/K. 
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Step 2: Consider the following commutative diagram, whose columns are inflation-restriction 
sequences. 


~~ ~~ ~~ 


0 —> H?(L/K, LX) —*5 H?(L/K,l,) “4 H?(L/K, Cz) 


Inf Inf Oi Inf 


~~ 


0 —> H?(M/K, M*) —2-> H2(M/K, uy) 2s UME, Cay 7 b[Z 
Res Res i ee 


0 —> H?(M/L, M*) —®+ H2(M/K, ly) 4 i Q/Z 


inv 


Q/Z. 


The columns are exact by the inflation-restriction exact sequence (Proposition |25]25.11.10) 
and the following: 


1. H'(M/L,M*) =0 by Hilbert’s Theorem 90 (Theorem |26]26.1.1). 
2. H'(M/L,Im) = 0 by Proposition |28.2.4 
3. H'(M/L,Cy) = 0 by Theorem [28.3.1 


The rows are exact because they come from the long exact sequences of 0 > L* — I, > 
C, > 0 and 0 > M* > Iy > Cy — 0, and the fact that H' of C,, Cy, is trivial (again 


by Theorem |28.3.1). 


Step 3: Next we show the maps inv are compatible with inflation. Indeed, since we have a 
class formation for local class field theory (Theorem 27|27.4.14), for every w | v we have the 
diagram 


invK, 1 


H*( Ey) Ky) — > aK] 


[srs | 


inv, 


H? (Ky) ——> Q/Z. 


ZZ 


Now H?(G(L/K), Ik) = ®yev, H?(G", (L”)*) by Proposition |28.2.4]and inv = Dy cy, invy, 
so inv is compatible with inflation. 


Step 4: Thus, we can take the direct limit over MW, noting direct limits preserve exactness, 
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to get (we will explain the dashed and dotted lines) 


eq:3x3-icf 0 0 0 (28.12) 
0 —> H?(L/K, LX) > H?(L/K,1,) “> H2(L/K, Cy) 
Inf Inf ; nf ~ i 
+ : al Invi sS st 
0 ——> H?(K, K*) —* > H?(K, Ig) —“~ H?(K, Cz) 17,/Z 
\ inv : 
Res Res ; Res care : 
=| inva \ : 
0-374) 4/70) yoo Q/Z 
> Se inv a 
inv3 ae i S 
Q/Z. 


Step 5: Now we show the maps inv; are compatible under restriction. Again, since we have 


a class formation for local class field theory (Theorem |27]27.4.14), we have the diagram 


inv, 


—4+Q/Z 
ResKy/Lw [Lw:Kv] 


H?(Lw) 


A? (Ky) 


inv, 


—+Q/Z 


Using H?(G(L/K), Ix) = ®yev, H7(G", (L")*), we can write an element of H?(K,Ix) as 
X = (Ly)yev,e, Where x, € H?(G,,(K,)*). On degree 0, Resx z is the diagonal imbedding 


—— ? <> 1, of Proposition |28.1.3} so on degree 2, 


Ee @ OH (Gu,K’). 


veVE wiv 


Resp X = (Resi, /2 ty) ay) 


vEVK 


The invariant map then sends this to 


S- So invy,, (Resx,/1,, i) = S- S° Nw/v inv K, fy, = 71 be invK, Ly = Ninve,X, 


veEVE wiv veEVE wiv vEVK 


using the fact that [LD : K] = wlv w/v, Where N»/y is the local degree. 


Step 6: By Theorem |28.4.11| the bent maps are complexes, i.e. im(z;) C ker(inv,;) for all 
three rows. 


Thus the maps inv, factor through the images im(p,), for 7 = 1, 2,3 to give the maps inv’. 
Be careful: we have only so far defined inv, : im(p,) + +Z/Z, and not inv, : H?(L/K,C,) > 
1Z/Z. We want to show that for certain extensions L/K, the p; are in fact surjective, so 
the map inv’, is an isomorphism H?(L/K,C,) + +Z/Z. 
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To do this we orders of im(inv,) and |H?(L/K,C_,)|. Again, we use H?(L/K,I,) & 
DucvVx _ L/L (Proposition |28.2.4). Making this identification using the invariant maps inv,, 
the invariant map takes (ay), € ®ryev, iZ/Z tO vce @v- Thus im(inv;) = mm) L/Z 
and 


|im(inv,)| = lem,(n,). 


We have that 
carlemnv leoMy (Ny) = |im(inv,)| = |im(inv,)| < |im(p,)| < |H?(L/K, C)| <n, (28.13) 


where the last step is the second inequality. We don’t get any information out of this unless 
lem,(ny) =n. For certain extensions L/K, we do know it is true, though. 


Step 7: We show that if L/K is cyclic, then lem,(n,) = n. Let S be the set of ramified 
primes and infinite places of kK. By Proposition [28.2.8] G(L/K) is generated by the elements 
Froby/x«(p) for p ¢ S. Now (Frobr«(p)) is sent to a subgroup of index n, in G(L/K). Since 
G(L/K) to be generated by these elements, we must have lem,(n,) = n. 

Then equality holds everywhere in (28.13), we have the exact sequence 


0 H°(L/K, L*) + H2(L/K,1,) + H°(L/K,C,) ¥ —Z/Z— 0, 
n 


where the map H?(L/K,1,) + 4Z/Z is the invariant map. 


Step 8: Taking the direct limit over all L C K, (as defined in Theorem |28.4.7) we get 


0 —> H?(K, KX) —> H?(K,/K,le,) —> H?(K,/K, Cx,) = Q/Z 0 


[is [i [in 


0 ——— H?(K) ———— H?(K, Iz) aad Q/Z 


where the top row is exact. By Theorem ]28.4.7| the left vertical map is an isomorphism. 
The middle map is also an isomorphism because Theorem ]28.2.4| gives that it is the map 


QD W(Ke/K.) > QB A*(K,). 


veEVK vEVK 


This is surjective because H?(K?/K,) = Q/Z via the invariant map, K” being the directed 
union of L,, with [L,, : K,] arbitrarily divisible. Hence it is an isomorphism. Finally, the 
right vertical map is clearly an isomorphism. Thus the bottom row is short exact and invx 
gives an isomorphism H?(K,Cz) > Q/Z, ie. the map inv} in (28.12). Restricting to 


H?(L’'/K,Cz,), it is an isomorphism to wKjL/Z for any L’, as needed. 


We are now ready to reap the rewards of our hard work. 
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Theorem (Global reciprocity, Theorem |24]24.6.1): Given a finite abelian extension L/K, 
there is a unique continuous homomorphism ¢;/x that is compatible with the local Artin 
maps, i.e. the following diagram commutes: 


K — > G(L/K) 


Moreover, @,/x satisfies the following properties. 


1. (Isomorphism) For every finite abelian extension L/K, x defines an isomorphism 


orp: Cx/ Nmzjx(Cr) = Ix/(K* - Nmzx(In)) 3 G(L/K). 


2. (Compatibility over all extensions) For L C M, L, M both finite abelian extensions of 
K, the following commutes: 
G(M/K) 


OM/K 
PL 


Ce 


Thus we can define @x := Lim 1 abelian 2L/K 88 @ map Ix > GK) K), 


3. (Compatibility with norm map) ¢x is a continuous homomorphism Ix > G(K?/K), 
and the following commutes. 


p= 4 GD /5) 
Nmp/K [eto 


x 83 G(K?/K) 


Proof. By Theorem [28.5.1] and the abstract reciprocity law (Theorem |27]27.4.8) we get iso- 
morphisms 5 jj¢ : Cx/Nmy/xK Cr, + G(L/K) satisfying the required compatibility prop- 
erties. We only have to check that $1). = @z/« (recall we defined $,/% as the product of 
local maps). From Theorem for every character x, x(x (a)) = inv«(a U 6x). 
But this is also true for ¢,/% by Proposition Hence $z/« = $x, as needed. 
Uniqueness is clear from the condition that ¢@ restricts to the local Artin maps. 


6 Existence theorem 


We now prove the existence theorem for global class field theory. 
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Proof of Theorem and Theorem This involves explicitly constructing norm 
groups and calculating norm indices, which overlaps with Section The proof is omitted 
for now. See Cassels-Frohlich [CF69], pg. 201-202. 

Theorem |24]24.6.4] now follows from the Existence Theorem and Theorem 


Finally, we prove that ¢x gives a topological isomorphism Ig /K*(Kx,)° > G(K*"/K). 
This finishes the proof of all theorems of global class field theory. 


Proof of Theorem|24124.6.5, First we prove that @x is surjective. We know that @y,/K : 
K 2 G(Hx/K) is surjective, where Hx is the Hilbert class field (See Definition [29]5), since 
this is a finite extensions. Thus it suffices to show dn,/K : lk 4 G(K*/Hx) is surjective. 

We know that for each place v of K, ¢x : Ky — W(k?2>/K,) is surjective (Theo- 
rem [24]24.2.4). Restricting to Uy, we get that ¢x|v, : Uy ~» I(K2>/K,) & I,(K”/K) is sur- 
jective. Since K* (Il,cv, Uv) /K* € Ik, it suffices to show IT, cy, [o(K*?/K) = G(K*”/K). 
Let K2>"" denote the maximal abelian extension of K unramified at v. We have by Theo- 


rem [L515.7.2/ that 


T] m(K*/) = TT GU/Ke") = 6 [27 A — = G(K/Hr) 


vEVK vEVK vEVK 


since Hx is the maximal abelian extension unramified at all places. This shows surjectivity. 
To show the kernel is A*(3)°, note that this is exactly the intersection of all open 
subsets of finite index in Ix. Add details. 
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Applications 


ch:cft-app In this chapter we give several important applications of class field theory to number 
theory, rewarding the reader for reading the difficult proofs in the last few chapters (or 
conversely, motivating the reader to read the proofs). 

Why is class field theory useful? It relates a field K to its Galois group G(K?>/K), so 
transfers information about the extensions of a field into information contained in the field 
itself, or conversely, relates the behavior of elements in the field K, to their behavior in 
various extension fields. Moreover, because the global Artin map is constructed from the 
local Artin maps, questions in number theory involving global fields like Q can be understood 
by patching together information from its completions (local fields). In the chapter, we will 
use the full power of class field theory to give solutions to the following problems. 

Throughout, we will assume that K is a number field. 


1. 


Reciprocity laws: We show, roughly, that whether a prime p is a perfect nth power 
modulo q, depends only on q mod p (actually, some multiple of p). Reciprocity hence 
shows that the Legendre symbol (2), is like a group homomorphism in both the top 
and bottom. The Artin isomorphism will give us the homomorphism in the bottom. 


. Local-to-global principle: We show the Hasse-Minkowski theorem: a quadratic 


form has a solution in K iff it has a solution in every completion of Kk. 


. Density of primes: We prove the Chebotarev density theorem on the distribution 


of prime ideals in a number field. 


. Splitting of primes: We show how a prime p splits in an abelian extension L/K 


depends only on p modulo a ray class group, since splitting behavior can be expressed 
in terms of the Artin map (Proposition [2424.1.3). We show this characterization 
is unique to abelian extensions, and give some examples for splitting in nonabelian 
extensions. 


. Maximal unramified abelian extension: We characterize the maximal unramified 


abelian extension Hx of a number field AK, and show that all ideals of K become 
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principal in Hx. Hx can be computed for quadratic extensions using the modular 
function j, which we show in Chapter [41] 


6. Primes representated by quadratic forms: We relate quadratic forms to primes 
using the Gauss correspondence (Theorem |17/17.5.1), then use the Hilbert class field 
to characterize which primes are represented by a given quadratic form. 


7. Artin and Hecke [-functions: We use class field theory to show that for abelian 
extensions, all Artin L-functions are Hecke L-functions. This is useful because it is 
relatively easy to show Hecke L-functions satisfy nice properties such as analytic con- 
tinuation and functional equation. This was Emil Artin’s original motivation for class 
field theory. 


Finally, we describle how class field theory fits as the “1-dimensional case” of the Langlands 
program. 


1 Reciprocity laws 


secrreclaws First we interpret and generalize the Legendre symbol using class field theory. We 
derive a generalize reciprocity law using class field theory, and then specialize to quadratic, 
cubic, and biquadratic reciprocity. 

Reciprocity laws take two forms. The first is as follows. 


Theorem 29.1.1 (Weak reciprocity): tim:wear-ree Let A be a number field containing all nth 
roots of unity. Let p be a fixed prime. Then there exists a modulus m and a finite subset 
S € I? /Pxi(m), such that for all p relatively prime to m, 


p is a perfect nth power mod q <> (q mod Pxi(m)) € S. 


In fact, S is the kernel of a certain homomorphism J (m)/Px1(m) > Ln. 


This tells us that whether p is a perfect nth power modulo qg, depends only on the modular 
properties of g, and is moreover characterized by a group homomorphism. However, it does 
not give an efficient method to actually determine whether p is a perfect nth power modulo 
q. To get this we turn to strong reciprocity. 

We know that the Legendre symbol (2) (and its generalizations to nth powers, (*) ), 
is a homomorphism in the upper component as well, so it is natural to relate these two 


ef 
homomorphisms: what is their ratio (2) (2) ? This will give us a natural algorithm to 


compute the Legendre symbol (*) . We will prove strong reciprocity at the end of this 
section, after we discuss the Hilbert symbol. 
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1.1 Weak reciprocity and the Legendre symbol 


The key observations linking reciprocity to the Artin map are that a is a perfect nth power 
Np—1 


modulo p iff a» = 1 (mod p) (just like (2) = a’> in the quadratic case), and the 


homomorphism a ++ a = can be linked to the Frobenius map. 


Definition 29.1.2: Let K be a number field containing an nth root of unity, and let p be 
a prime ideal with an L Ntp. Define the Legendre symbol (2) to be the unique nth root 
of unity ¢ such that 
MNp—1 
¢=a~ (mod p). 


To see this is well-defined, note the following two points. 
Np-1 


n 


1. The nth roots of unity are distinct modulo p because n L tp. Hence is an integer. 


2. Cam )" = 1=1 (mod p) by Fermat’s little theorem so a is equivalent to a unique 


nth root of unity. 


Proposition 29.1.3: Let K be a number field containing an nth root of unity, let p be a 
prime ideal with an L Mp. Then a is a perfect nth power modulo p iff (2) = 1. 


Proof. Let the residue field of p be k. As k* has order Stp—1 and is generated by 1 element, 


a is a perfect nth power modulo p iff ——— (2) =1 


Proposition 29.1.4: (2) is a group homomorphism factoring through (x /p. 


Proof. Clear. 


How can class field theory give us an expression like this? Well, the Frobenius element 

Np-1 
corresponding to p acts like taking the Ytp power modulo p. How do we get to a =e By 
acting by the Frobenius on ¢/a instead. 


Proposition 29.1.5: price-cr: The following holds: 


or vane (90) 


where L = K(%/a). 


Proof. First note p { an implies that K(%/a)/K is unramified at p, by Theorem |21/21.2.5 
By definition w/x(p) is the homomorphism that sends b to 6” modulo p. Thus 
Np-1 


bey (p)|(X/a) = Va" =a = Ya (mod p). 
But <a satisfies X" — a = 0, so (p, L/K) must send ~/a to ¢ %/a where ¢ is some root of 


unity. The above equation shows that we must have ¢ = ; , as needed. 
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b 


defined for primes p. Equivalently (by Proposition 29.1.5), define (¢) = uO Ya) 1 
We can now prove weak reciprocity. 


Proof of Theorem|29.1.1, By Proposition |29.1.5} 


eae en _ WK wats Ya) 


We define Gas for any b € i¢ ae ) by extending = the map (¢), originally 
(b) 


(29.1) 


Taking a = p and p = (q), we get 


(2) _ Wace (DIY?) 
d/n VP 
Let m be the conductor of K(x/p)/K. Since «( yp)/« is an homomorphism on J7/i(Pri(m)) 


(Theorem |24]24.4.1), its kernel contains i(Px atm a In other words, when g € i(Px1(m)), 
K(YD/K 


then (2) = Wa aoe UN en = 1 and pis a perfect nth power modulo q. 


1.2 Strong reciprocity and the Hilbert symbol 


To prove strong reciprocity we need to actually compute (29.1). Supposing p is a principal 
ideal (b), our statement about reciprocity seems to suggest that b and a play similar roles in 


the equation:? 
=) — [erp (0)|(/e) 
bia °/a, 
However, is not symmetric. We seek to symmetrize it. 

But look at Proposition |26]26.2.2| Equation is the character corresponding to the 
element a € K*. Using the map in Kummer Theory, we can get the equation symmetric in 
a and b. In fact, we did this already when we defined the Hilbert symbol. 

If motivation was lacking when we defined the Hilbert symbol, hopefully this clears things 
up: it explains and clarify the duality in a and 6b observed above by making it symmetric in 
a and b. 


(29.2) 


eq:leg-symb-cft2 ( 


Proposition 29.1.6: pr:nitbertisree Let 6 { n be prime in K and Ky the completion at b. Let 
(0: AX /K xX” x KX / KX” pn denote the Hilbert symbol. Then for a L b, 


(2, bn = (F) 


TWe can extend the definition to all prime elements p by definining (2) = 
n 


éi/K (tv (p))( Ya) : 
Ya , then extend 


the definition of (¢} to encompass any b € K™ by multiplicativity. For instance, in the case n = 2, this 
b) n y 


gives the Jacobi symbol. For b = 2, (4) tells us whether a is a perfect square modulo any power of 2. 


?Caution: we’re using the Artin map on ideals; we write Wr/K(b) to mean ~p/xK((b)). In contrast, 
ér/K(b) =1 since bE K. 
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In general, if K,(%/a)/K,(a) is unramified, 


a (Crs) 
a, 9) an = : 


ae 


where (a, b),,, denotes (a, b),, when a,b are considered in Ky. 


Proof. Proposition |29.1.5| and Proposition |27]27.6.3] give 
(2) _ We ONY) _ Wins) _ oy 
b Re Va va »Y¥) bn 
For the second part write a = mu and b = r*u’ where u,u’ are units, and use bilinear- 


ity |27|27.6.4] to compute 
(ru, a*u’) = (1, n*u’)) (u, r)* (u,u’) = 1 since K(%/a) unramified, |27]27.6.5 


, k 
= (x, —m)** (a, (—1)*u')3 (<) by the first part 
k 
= ((—1)*u’, x) (=) (x,—7m) =1, Theorem |27]27.6.4(2) 
T7 n 


The last main ingredient is the product formula for Hilbert symbols. 


Theorem 29.1.7 (Product formula for Hilbert symbols): tim:nitert-proa Let AK be a number 
field containing the nth roots of unity. Then 

II (4b) =1. 

vEVK 
Proof. Using the fact that the global Artin map can be written as the product of local Artin 
maps, 


IT dx.cvayn. (0) = ox(b) = 1, 


VvEVK 


because $x is the identity on K. Now operate on this by the character x(o) = ove) Ek 


and use Proposition |27]27.6.3]to get 
II (4. = IL x(x yayx(0)) = 1. 


vEVK vEVK 
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Combining Proposition |29.1.6] and |29.1.7/ gives the strong reciprocity law. 


Theorem 29.1.8 (Strong reciprocity): tnmstroneree Let AK be a number field containing a 
primitive nth root of unity and suppose a, b,n are pairwise relatively prime. Then 


(5), = IL ben 


v|noo 


Proof. Suppose a, b,n are pairwise relatively prime. For a number c let S(c) denote the finite 


places v where u(c) 4 0. We calculate (2), and (2). using multiplicativity. We have 


3).(3). = (rae) (Tones) | 
Ee . = o=( Te), @=( me 
(5), (|; n resi) al?) Tres(a) gist) mES(b) TES (a) 


a\ ?r() b\ 7x (a) 
OO 
JL G), vnes(a) \"/n 
G\ vx(b) b-ex(@) 
“HO (=) 
AL) ve €S(a) ac 
= ][ (ao). Jf @bp by Proposition [29.1.6] 
veES(b) vES(a) 
= [I (ab). (a, b)c = 1, (a,b), = 1 when a,b € Ui, 
vince 
= Il (0, a)y 
v|noo 


where in the last step we used the product formula |29.1.7} which tells us [],cy, (a, b)y = 1. 


Now suppose a is a prime dividing n. Then again using multiplicativity, Proposi- 


tion [29.1.6 and the fact that (a,b), = 1 for v | noo, nfa (Corollary [27]27.6.5), 
Un (b) 
ch ~ I] (=) - Il (a,b). = Il enare 


vES(b) v|noo 


In practice, we can compute the action of the Hilbert symbol for each v | noo, since 
Kx /KX" is a finite set. We will carry out these computations in the cases n = 2,4, for 


K = Qand Q(t). 
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1.3 Quadratic and biquadratic reciprocity 
We derive quadratic and biquadratic reciprocity using Theorem ]29.1.8 


Theorem 29.1.9 (Quadratic reciprocity): Let p,q be odd primes. Then 


=cwr, Q=ent. @(s)<cor 


a 


Proof. The first follows from definition of the Legendre symbol. By strong reciprocity |29.1.8| 


@ = (2,p)a 


Let U denote 1 + (2)' in Qo. 


1. We have (2,p)2 = 1 iff p is a norm from Q,(V/2) (Theorem 27]27.6.4), iff p is in the 


form x? — 2y? in Qy. Looking at this modulo 8, we must have p € {1,5}27. This is 
sufficient as we know [Q> : Nmg,(/2)/0,(Qe(v2)”)] = [Q.(V2) : Q] = 2, so we must 

= 
have Nmg,(y2)/0, (Qe(V2)”) = {1,5}2%. Hence (2,p)2 = 1iffp = 1,5 (mod 8), iff "> : 
is even. This gives 


. We have (p, q)2 = 1 iff ¢ € N := Nima, yp)/Q.(Qa(/P)*), iff q is in the form x? — py’. 


(a) If p = 1 (mod 4), then x? — py? can attain any odd residue modulo 8. Since 
IQ: N] = [(Q(VP) : Q] < 2, we have U8)2?4 — Q¥? CN. Since N contains all 
residues modulo 8, U2?4 C N. Hence q € N, and (p,q)2 = 1. 


(b) If p = 3 (mod 4), then 2? — py? cannot be 3 (mod 4). Hence N = U®)22, and 
q € N iff g=1 (mod 4). Hence (p, q)2 = 1 iff g=1 (mod 4). 


It remains to note lems = 1 iff either p= 1 (mod 4) or g=1 (mod 4). 


Theorem 29.1.10 (Biquadratic reciprocity): Suppose p,q are primes in Z/i] with p,q =1 
(mod (1+ 7)?). Then 


(2) -corene(2), 


Note every prime contains an associate that is equivalent to 1 (mod 4). 
Proof. Note p= 1 (mod (1 +7%)*) means p= 1 or 1+ 2% (mod (1 +7)°). 
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By strong reciprocity |29.1.8} 


(5), (0), tame 


We have (p, q)2,4 = 1 iff ¢ © Nma,(/p)/a,(Qe(\/p)*). Consider 2 cases. 


1. Np =1 (mod 8). Equivalently (writing out p = a+bi and calculating the norm), p = 1 
(mod 8). We can calculate that (1 + i)37U® CN := Nmgyyp,)/Qa(i) (Qa(VP.4)*), 80 
q € N. (The calculations are lengthy, but here’s the idea: by examining the structure 


of Q»(i), or using Proposition |28}28.3.2| we find that Qo(i)*4 = U™(1 + i)". Hence 


the norm group N satisfies 
UM%1+i)" CN C Q(i)* 


and has index at most 4. Now calculate the norm of enough numbers in Q2(,/p, 7) until 
we can determine (1 + 1)32U°) C N. Using a computer algebra system is advised.) 


2. Mp =5 (mod 8). Equivalently, p= 5 (mod 8). We can calculate that (1 + 1)42U°) C 


Nmg,(5)/Q2(Qa(./p)*) but (1 + 2%)(1 + i)”4U® Z Nma,(jp/o,(Qa(VP)”). Hence 
(p,q)4 = 1 iff ¢=1 (mod 4), ie. iff %q =1 (mod 8). 


In the case where Mp, Nq = 5 (mod 8), we have (p,q); = (p,q?) = 1 but (p,q)4 # 1 so 
(p, q)4 = ad, 


1.4 Reciprocity for odd primes 


We give an algorithm for finding reciprocity laws for Q(¢,) /Q for p prime, and then specialize 
to p= 3. 


Theorem 29.1.11: timexpiicit-ree Let p be an odd prime, let AK = Q(G,), and let v be the 
valuation corresponding to 1 — ¢,. Let 7 = 1 —(¢,. Then the elements 


1 
yH=l=c=G 
nN =1-—7° 


tp = 1l— ax? 


generate KX /Kx?, and (a, b), is the unique skew-symmetric pairing Kx x KS > uy satisfying 
the following. 


Le se = (ites eT a - 
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i, iS¢<e=1 
¢, t=p. 


2. (ni, T)y = | 


Moreover, if i+j >p+1, then (a,b), = 1 for alla € U™ and b € U), 
We start with the following lemma. 


Lemma 29.1.12: Let K bea number field containing pth roots of unity. Let ¢ be a primitive 
pth root of unity, 7 = 1—¢, and p a prime dividing 7. Suppose a = 1+ 7?c with r =1—¢ 
and c € @,. Then for all 8, 

(a, b)» = C— tp (rp (0) 


We will just need the case where K = Q(G,), in which case k = F,. 


Proof. Because a ¢ p, K(¥/a)/K is unramified by Lemma |20]21.2.5| We have (cf. Proposi- 
tion [I9f19.2.2) 


‘on 
<0 
(l—m)P—1_ 
a emer 


=> ql _ pg? 24... t+ n= 0 
— 7? +=-p (mod pr) 
and we get 


qP-1 


eq:explicit-recl = —l (mod p). (29.3) 
Pp 


Let a = ¢/a be a pth root of a, and write a = 1+ 72, where x € L. Now a™ — 
a = 0 becomes (1 + mx)? — (1+ a?c) = 0. Hence z is a zero of the polynomial f(X) = 
+ ((1 + 7x)? — (1+ 7%c)). Using (29.4), we find that f(X) is integral, so 7 € @;,, and that 


modulo 7, 


i] 
F(X) = — (nPa? + pra —1?c) = a?-—x—ec (mod m) 
1 
Let Ntp = p!. Letting o be the Frobenius, we find that o(x) =x?’ (mod p). Note that 


J y= at i 
xP =(a@+c)P  =aP +c 


(mod p). 
Hence by induction 
ecrexplicit-reci (7) = Pl = 7 + Cc + ou + ace + oe" = 7 + tlk/Fp (c) (29.4) 


in k. Now by Proposition |27}27.6.3} 


(a,b) = Sxstoyine) a) _ sel 
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To get the second equality, note that by construction, $x, (a)/K,(7) is the Frobenius element; 


as K,(a)/K, is unramified, Ux, C ker $x,(0)/K, (Example 27|27.5.1), and the Artin map 
depends only on v(b). We have 


(a,b), = C” where C"a = 0° )(a); 
to find n we reduce both sides modulo pz. We calculate 


Ca = (1—a7)(l+ar)=1+(x%-—1)n (mod pz) 
=> C"a — 1 + (ae — n)T (mod P77 Jearexpticit-reca 
oO(r) = a+ v(b)trzyr,(€) (mod 7) by 
=> og (®) (a) = 1 ++ (a ++ u(b) trie, (c))a (mod P77) -eqeexplicnvecs 


Matching (29.6) and (29.8) gives n = —v(b)trgp,(€) and 


(a,b) = CO" (mod 7). 


In particular, note that (a,b), = 1if a= 1 (mod 7?*t!). By nondegeneracy of the pairing 


(Theorem |27]27.6.4), we get that a € (KX)?. Hence U®+) C (Kx)?. 


Proof of Theorem|29.1.11| Note that n; generates U /U*, and m generates KX /(KX)PU, 
As mentioned above, U'?*+) C (Kx)? so 7,m,---;Mp generate Kx /(K*)?. Since the group 
has order 7 = p?t! (Proposition |28]28.3.2), these generators are independent. 


We use a relation between the 7;,7; to derive the first relation. Namely, we have ma + 


at+7 
(4, ni) Sy 
M+j Mita / p 


by Theorem |27.6.4| Note (a,—1) = 1 for any a because —1 is a pth power. Expanding the 
above bilinearity gives 


ri = 1, so 
Nitj 


1 = (nj, 7m) (ij, 7m)” (iggy, —M-43) (M43, —D) (ny, M47)” 
p——— SS SS 


1 1 
= (nim) (nj.7?) (mtg, 7)" (nisg, me) (ng, M43) 
=1,nj+7I=1 


= (nin) (nits, 7)? (Mig, Mm) (Migs 1) 


= (m5) = (Mis M5) (Mita My) (Mtg): 
This shows item 1. For item 2, note for 1 <i < p—1 that since n, + 7’ = 1, 
1 = (m, 7") = (ni) => 1= (mi, 7). 
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For i = p, we use the lemma to find 


Cres oe = C~ tk ep (1) = ¢ 


because k = F,. 

Note that ifi+j7 >p+1, then n4; € U®t) C (Kx)? so item 1 gives that (;,7;) = 1. 
Now as a skew-symmetric bilinear pairing (7;,1;) is determined by items 1 and 2, because we 
can expand (7;,17;) using item 1, then repeatedly expand factors (the indices increase each 
time) until we only have factors in the form (e,7), and use item 2 to get a value out. 


We now use this to derive cubic reciprocity. 


Theorem 29.1.13 (Cubic reciprocity): tnm:cubicree Let K = Q(w), where w = (3 = as 
For a = +1 (mod 3@x), write 


a= +(1+3(m+nw)). 


Then 
(2) = (), ifb La,b=+1 (mod 36x) 
(3) arm 
a’3 
fa as 
Qs 
a 3 


Note that if g # 1 (mod 3) is prime, then 3 f |IF*| so any element of Ff is a cubic residue. 
Note any element of K relatively prime to 3 can be written in the from w’(1 — w)/a where 
a = +1 (mod 3@x). 


Proof. First suppose a,b = 1 (mod 3). By Strong Reciprocity |29.1.8} 


(5), (3), =e 


3 


Note a,b € U® so by Theorem |29.1.11} (b,a)3 = 1. This shows the first equation. 
For the second, letting 7 = 1 — w, note that 


(1 — 17)*(1 — 2°)? = (1 + 8w)9(1 + 3(2w +:1))* € [1 4+. 3(8 + (28 + a)w)U 
Setting a =n —2m and 6 =m, we get 
cedar laa "Ue 


(1 = cy _ on be E aU) 
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Now Theorem |29.1.11] tells us 


(w, 1 — 7?) = (m,m) = (3,7)? = w 
(w,1—m*) = (m,n3) = 1 

(x,1—77) =(m,7) 1 =1 

(x, 1 —7°) = (3,7) = wt. 


Thus 


As an application, we show the following. 


Theorem 29.1.14: tnm:2uic If ¢g = 1 (mod 3) is a prime, then 2 is a cubic residue modulo q 
iff g is in the form 
q=274+27y’, for some x,y € Z. 


Proof. Since q = 1 (mod 3), q splits in @x as aa. By multiplying by a root of unity, we 
may assume a = | (mod 3@x), ie. a is in the form a = 3(4 + yw) + 1. In order for 2 to 
be a cubic residue, it must be a cubic residue modulo a. If a? = 2 (mod a), then a = 2 
(mod @), so it would also be a cubic residue modulo @ and hence modulo q. 

Now (2) = Litt (<) = 1, by Cubic pees Since 2 remains inert in Ox, and 


the only cube in Ff is 1, we get that a@ must actually be in the form 


a = 6(4 + yw) +1. 


Taking the norm gives 
p = (6x + 3y £1)? + 27y’. 


This is in the form x’? +27y"; conversely, any prime in the form 2’?+27y” must have x’ = +1 
(mod 3), and hence is in the above form. 


2 Hasse-Minkowski Theorem 


The global Artin map can be expressed as the product of local Artin maps. From class 
field theory, we get various “local-to-global” results such as the Hasse-Brauer-Noether The- 
orem [28]28.3.5]and the Hasse Norm Theorem |29.2.2]| The most famous is the local-to-global 
principle for quadratic forms, the Hasse-Minkowski Theorem. 


Definition 29.2.1: A quadratic form is said to represent a if there is a solution to 


q(X1,..-,Xn) =a with (21,...,2%,) # (0,...,0). A quadratic form representing 0 is said to 
be isotropic. 
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(For a review of quadratic forms, see Chapter [i7}) 

Where class field theory comes in is that a quadratic form in 2 variables representing 
a number a can be interpreted as a norm equation, a = x? + by”. We can write this as 
a = (a + yvb) (2 — yb) = Nm, (Vby/K (E+ yvb) when Vb ¢ K. Class field theory gives us 
a local-to-global theorem for norms, the Hasse Norm Theorem. This will prove the n = 2 
case of Hasse-Minkowski. Then a series of elaborate reductions will prove the local-to-global 
principal for any number of variables. 


2.1 Hasse norm theorem 


Theorem 29.2.2 (Hasse norm theorem): tnm:hasse-norm Suppose L/K is cyclic. Then a is a 
global norm iff it is a local norm everywhere: a € Nmy x L* iff a © Nmpo/x, L’™ for all 
v € Vr. 


Compare this to the proof of Theorem |28}28.3.5 


Proof. The forward direction is clear. 
Let G = G(L/K). Take the long exact sequence in Tate cohomology of 


(Si =I, 0,50 


to get the top row of the following. 


eq:hasse-norm He (G, C,) ) ——_> Any G, i) ) ——____ AX(G G,I,) irr (29.9) 


0 ———_> K*/ Nmyz/x L* —> @ycy, KX /Nmx,(L*) 


We explain the bottom row. First note the equalities of H? are by definition of H?, plus 
Proposition |28]28.2.4| Next note cohomology is 2-periodic because G is cyclic (Proposi- 
tion |25]25.12.1), and H+(G,C,) = 0 by Theorem |28}28.3.1] (HT90 for ideles), so 

He (GC; )= HAG, Cz) =0. 


Then (29.9) gives that the map K*/Nmz/x L* 9 @, cy, KX /Nmx,(L°”) is injective. If 
a € K* is a norm in every completion, then it is 0 in @,cy, AX /Nmx,(L°*), hence 0 in 
K*/Nmz/x L*, hence a global norm. 


2.2 Quadratic forms 


We prove the following. 


Theorem 29.2.3 (Hasse-Minkowski): nasse-minkowsii Let AY be a number field. The following 
hold. 
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1. A quadratic form f defined over K represents a iff f represents a in every completion 
Ky. 


2. Two quadratic forms over K are equivalent iff they are equivalent over every completion 
Kop 


First we note that item 1 implies item 2. 


Proof that 1 implies 2. The forward direction is clear. For the reverse direction, induct on 
the rank n, n = 0 being the base case. Suppose f,g are equivalent over every completion 
K,. Suppose f represents a. Then f represents a over every K,. Since g ~ f over every K,, 
g represents a over every K,. By item 1, g represents a. 

Thus we can write f ~ aX? + f’, g~ aX?4+ 4’. Now aX? + f' ~ aX? +4! over every 
K, implies (see Serre IV.1.7, Prop. 4]) f’ ~ g’ over every K,. By the induction 
hypothesis, f’ ~ g' over K. Thus f ~ g. 


Next we show that we can reduce item 1 to a statement about quadratic forms repre- 
senting 0. 


Lemma 29.2.4: Suppose char(’) 4 2. An nondegenerate isotropic quadratic form over kK 
represents all of K. 


Proof. Let B be the bilinear form associated to g. Suppose x # 0 is such that q(x) = 0. Since 
q is nondegenerate, there exists y such that B(x, y) 4 0. Then q(x+ay) = a2q(y)+2aB(x, y) 
attains every value as a ranges over K. 


Lemma 29.2.5: A quadratic form q(Xj,...,Xn—1) represents a iff q(Xq,...,Xn-1) — aX? 
represents 0. 


Proof. For the forward direction, suppose q(#1,...,%m—1) =a. Then q(%1,...,2%n—1)—a-1? = 
0. 

For the reverse direction, let (x1,...,2Z,,) be asolution. If x, = 0 then q(x,...,%n) =0so 
q represents 0. Thus q is isotropic and represents a. If x, 4 0 then q Ge Sas ae =a. 


Thus it suffices to prove item 1 of Hasse-Minkowski for a = 0. Specifically, item 1 for 
forms with n variables is a consequence of item 1 for a = 0 for forms with n+ 1 variables. 
We now prove Hasse-Minkowski. Every quadratic form over a field not of characteristic 2 
can be put in diagonal form, so it suffices to consider diagonal forms. By scaling, we may 
assume one of the coefficients is 1. 


2.2.1 Proof for n < 2 


For n = 1 the theorem is trivial. For n = 2, we need the following. 


Lemma 29.2.6: An element a € K is a square iff it is a square in every completion Ky. 
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Proof. (cf. the proof of Proposition |28]28.2.8) The forward direction is clear. 
So suppose a is a square in every completion. Then K,(./a) = K so Nmx,yayx, Kv(Va)* = 


KX. This shows Nmx yay/x (Ix(ya)) = Ix. By the first inequality [28]28.2.1] 
[K(Va) : K] < [Ix : Nmxyyayx(Ix(yay)] = 1 


so K(,/a) = K, i.e. a is a square in KX. 


Now a quadratic form 


represents 0 iff a is a square (it rearranges to (2) = a), so q represents 0 over K if it 
represents 0) over every K,. 


2.2.2 Proof for n=3 
As promised, we re-express the condition for p(a) to represent 0 as a condition on norms. 
Lemma 29.2.7: tem:3var-norm Let K be any field. A quadratic form 
g(X,Y,Z) = X* — bY" — cZ? 

represents 0 iff c € Nmxye/x(K(v.)*). 
Proof. Note if q(x, y,z) = 0 with z = 0, then b must be a perfect square. If b is a perfect 
square then K(\/b)/K is trivial and c is trivially a norm. 

So it suffices to consider solutions with z 4 0 and 6 not a perfect square. In this case, 

x? — by? — cz? =0 


iff 


c= (2) -0(¥)"= (E-v6- 4) (E+ v6.2) = simayoye (=v) 


By the Hasse Norm Theorem ]29.2.2) c € Nmxye/x(K(V8)”) if this is true for every 
completion K,. Combined with the Iemma above, this gives Hasse-Minkowski for n = 3. 
We will need the following in the proof for n > 5. 


Lemma 29.2.8: 3-ataimost The form f = X? — bY? — cZ? represents 0 in a local field K, iff 
(b,c)y = 1. Moreover, f represents 0 in K, for all but a finite number of places v. 


Proof. Note f represents 0 iff c € Nx yp (K(Vb)*), which is equivalent to (b,c), = 1 by 
Theorem |27]27.6.4| Only finitely many of these are not equal to 1 by Corollary |27]27.6.5 
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2.2.3. Proof for n =4 


We reduce the n = 4 case to the n = 3 case (but for a different field extension) by the following 
string of equivalences. The brilliant idea here is to turn the quadratic form equation into a 
quotient of norms. 


Theorem 29.2.9: ims For any field K, the following are equivalent, for a,b,c € K*. 
1. The form f(X,Y, Z,T) = X? — bY? — cZ? + acT? represents 0 in K. 
2. cis a product of norms from K(./a) and K(vV0): 


c€ Nmgiyay(K(Va)*) Ning yp (K(VB)”). 


3. cE NmgyayayK (vas) K (Va, vb)*). 
4. The form g(X,Y,Z) = X? — bY? — cZ? represents 0 in K(Vab). 


Proof. (1) <> (2): If (x,y,z, t) is a solution with 2? — at? = 0, then 2? — by? = 0 as 
well. Then a,b are squares in K and (2) is clear. So it suffices to consider solutions with 
z* — at? #0. In that case, 


x? — by? 
2? — at? 


= (x — Vby)(x + Vby)(z — Vat)“\(z + vat), 


x? — by® —cz*+act? =0 => c= 
and this has a solution iff (2) holds. 


(4) <= (3): Applying Lemma |29.2.7| we see (4) is equivalent to c being a norm from 


K (Vb, Vab)/K (ab). But K(vVb, Vab) = K(/a, V0). 


(2) <= (3): This is the hard part. We consider the field extensions 


= K (va, vb) 


L? = K(Va) L°? = K(Vab) LT = K(v) 
ee al 


If any of a,b, ab is in K*? then the result is clear: If a € K*? then both (2) and (3) are true 
for any c, since K(./a) = K and K(./a, Vb) = K(Vab). If ab € K*? then K(./a) = K(vb) 
so both (2) and (3) are equivalent to ¢c € Nmg (yay (K(Va)*). 


Now assume a,b, ab ¢ K*?. In this case G(K(Vab)/K) © Z/2 x Z/2 with the 3 subex- 
tensions corresponding to 3 subgroups. Let a be the non-identity element fixing K(,/a), 7 fix 


398 


Number Theory, §29.2 


K(¥b) and p = or fix K(Vab). (Le. o switches +Vb and 7 switches +\/a.) We convert the 
statements in (2) and (3) into the language of Galois theory, using the fixed field theorem. 
Note (2) is equivalent to the following: 


(2)’: There exist z,yeEL, o(4)=2,7(y) =y, zp(x)yp(y) =e. 
To go between these statements take 
a! =z— Vat, y=x— Voy 
and note p conjugates both fa and Vb. Similarly, (3) is equivalent to the following. 
(3): There exists zE€L, zp(z)=cG 


just take 2’ = (2 — Vby)(z — /az). To go from (2)! to (3)! just take z = xy. To go back 
from (3)! to (2)’ requires more work. Given z, let u = aol) Now o(u) = wu and up(u) = 
zee) — 1, Since o(u) = u. ie. u € K(Va), and G(K(Va)/K) = (1, 7] kya}, by 
Hilbert’s Theorem 90 (26]26.1.1) there exists x € K(/a) (ie. x satisfying o(x) = x) such 


that rte) =u. Set y = &. We’ve chosen x satisfying the conditions. For y, note 


z 
x 


Ty) = 7(z) Tp=c7 
_ o(z) T(x) = 
2 8: _ zo(z) 
=A oy 2) =e 


Finally, zyp(xy) = p(z)p(p(z)) = c¢. This shows (2)’ => (3)! and finishes the proof. 


Now we show Hasse-Minkowski holds for n = 4. By (1) <= (4) in Theorem ]29.2.9} 
Hasse-Minkowski for f = X? — bY? — cZ* + acT? over K is equivalent to Hasse-Minkowski 
for g = X? — bY? — cZ? over K(Vab), and we have already proved Hasse-Minkowski for 
= os 


2.2.4 Proof for n> 5 


We now prove Hasse-Minkowski for n > 5. We proceed by induction. The idea is to “replace” 
aX? + bX? by just cX?. 
Suppose it proved for n — 1, and write 


f (Kayes, Xn) = OXF + OXF = g(Xy,..., Xn). 
Suppose f represents 0 in each K,. Then there exists c, such that 


OX? + OXe = 6. = OX 5,005, Xn) 
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has a nontrivial solution in K,. By Lemma there exists a finite set S such that g 
represents all elements of K, when v ¢ S. We only need to focus on v € J. 

Note Kx? is open in KX by Theorem By the Weak Approximation Theo- 
rem there exists c such that c € c,K*? for all v € S. Since c, is in the form 
ax} + brs, so is c. Then c = g(X3,..., X,) has a solution for all v. 

Thus 

h(X, X3,...,Xn) = eX? — g(X3,...,Xn) 


represents 0 in all K,. By the induction hypothesis, it represents 0 in K as well. Then f rep- 
resents 0: if c = ax? + bx? then replace the solution (x, 73,...,%n) with (7x1, 222, 13,..., 2p). 
This finishes the proof. 

We now use Hasse- Minkowski show that most quadratic forms in n > 5 variables represent 
0. 


Lemma 29.2.10: totsrep-o A form f = X? — bX? — cZ? + acT? represents every nonzero 
element over a local field K unless K = R and f is positive definite. 
A form f in n > 5 variables over K represents 0 unless K is real and f is definite. 


Proof. First we show that if f does not represent 0 in K, then a,b ¢ K*?, ab € K*?, andc Z 
Ning (ya) K(/a)* = Ng yy) K(¥Vb)*. Ifa or bis in K*? then f clearly represents 0, so a,b ¢ 
Kk*?, By ~ (1) => ~ (2) of Theorem|29.2.9| ¢ ¢ Nm, jay/x(K(V/a)”) Nmxyp/x(K(V)”). 
If K(./a) 4 K(Vb), then the norm groups are distinct groups of index 2 in K*, by the cor- 


respondence between norm groups and extensions. Then their product must be all of K%, 
a contradiction. Hence, K(/a) = K(Vb) and ab € K*?. Then ~ (2) becomes simply 


e ¢ NmgiyayK(K(Va)*). 
Conversely, suppose a,b ¢ K*?, ab € K*?, andc ¢ Nmg (vq) K(/a)* = Ning) K(Vb)*. 
Let N := Nmgiyay/«(K(/a)*); as noted it has index 2 in K*. Then 
ig — by? — cz” +. act? : 2, y,z,t € K not all zero} = {x — by’) — c{z? —at?} = N-—cN 


where A+ B denotes {atb:a€¢A,be B}. Sincec ¢ N,0 ¢ N—cN. Since N —cN is 
invariant under multiplication by elements of N, it is a union of cosets of N. Suppose that 
N—cN # kK. Then N —CcN is either N or cN, and 


{N —cN,cN — 2° N} = {N,cN} 


SO 
N-cN+cN-—CN=N4cN. 


If —1 € N, then N+cN = N —CN is cN or N, which is a contradiction because 0 is in the 
LHS above. Hence —1 ¢ N. Then 


(N —cN) —(cN —c?N) € {N —cN,cN — N} = {N,cN} 
Now c,—1 ¢ N imply —c € N, so (N—cN) —(cN-C?N) =N+N+N+4+N€{N,cN}. 
Wehaye FP 4trP era]? eN and 344 =o, co NNN EN = and 
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N+N = N (as it is a union of cosets). This implies that there exists a choice of sign in 
K: K™* is the disjoint union of the closed semigroups N of “positive” elements and —N of 
“negative” elements. If K is p-adic then this cannot happen as we must have N D Z = K 
where @ denotes closure in the v-adic topology. The only possibility is K = R. Because N 
consists just of positive numbers, f is positive definite. This proves the first part. 

For the second part, write f(Xj,...,Xn) = 9(%1,.--,Xn_1) — GnX?. By the first part, 
g(X1,...,Xn_1) represents every element of K* unless kK = R and g is positive definite. 
(Just consider g(X1,...,X4,0,...,0).) In the first case, g represents a, so f represents 0. 
In the second case, g represents all positive reals, and f fails to represents all reals iff a, is 
negative, i.e. f is positive definite. 


Corollary 29.2.11: A form f in n > 5 variables represents 0 in K unless there is a real 
place v with f positive definite in K,. 


Proof. This follows directly from Lemma.|29.2.10]and the Hasse-Minkowski Theorem |29.2.3 


3 Chebotarev density theorem 


Definition 29.3.1: The density of a set of primes S in K is d if 


_ pe S| dtp < N}| 
d= lim 
Nooo {p| Itp < N} 


The Dirichlet density of a set of primes S in K is 0 if 


Ses 
4 lan = . 
solt ln B-1 
(Note that pan aa ~ | 4 as s — 1* by a weak version of the prime number theorem for 


number fields.) 


Note if a set of primes has density d, then it has Dirichlet density d (an exercise in 
partial summation), but a set of primes having a Dirichlet density may not have a well- 
defined density. 


Theorem 29.3.2 (Chebotarev density theorem): ca: Let L/K be a finite Galois extension 
of number fields, and let C' be a conjugacy class G. The set of prime ideals p of K such that 
(p, L/K) =C has density tat: 

In the special case that G is abelian, the conjugacy classes are just elements and they 
occur with density a An especially notable case is the following. 
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Example 29.3.3 (Dirichlet): Let n € N and k be relatively prime to n. Then the set 
{q prime |qg=k (mod n)} 


has density oan 
Indeed, Chebotarev gives that the density of g where (q,L/K) is a specific element is 


aan: By Example|24/24.1.6| this gives that the density of q being a specific (relatively prime) 


residue modulo n is —. 

p(n) 
Example 29.3.4: If L/K is a Galois extension, then the density of primes of K splitting 
in L is Taag 


Indeed, a prime splits completely iff (p, L/A) = 1, by Proposition |24]24.1.3 


3.1 Proof 


We prove a weaker form of the Chebotarev Density Theorem, with Dirichlet density. We 
will need the following. 


Theorem 29.3.5 (Dirichlet’s theorem for number fields): timdiricntet-ne Let A be a number 
field, let H be a congruence subgroup modulo m, and let & be a class in J? /H. The set of 
prime ideals p of AK such that p € & has density aH" 

BE 


Proof. See Lang [Lan94\ VIII. §4] for the proof with Dirichlet density. O 
In the proof below, we use “density” to mean “Dirichlet density.” 


Proof of Chebotarev Density Theorem|29.3.4 We can’t deal with nonabelian extensions di- 
rectly, so the idea is to reduce to the abelian case as follows. Consider L/L’; this is cyclic. 
A prime in L with (8, L/K) = o descends to a prime 8’ such that (B’, L/L°) = o|ze. 
Since L/L” is abelian, these primes ‘8’ are characterzed by a modular condition, and we 
can find their density using Theorem [29.3.5] Then we will relate the density of primes with 
(p, L/K) = C to the density of primes with (8, L/K) =o. 


L She 
L? LaF Su 
K 


S= {p: (p, L/K) = C}. 
Note that fixing o € C, p € S iff there exists 8 | p in L such that (B,L/K) =o. 


Let 
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Suppose o € C has order f. Then L/L” is a cyclic extension of degree f. Let ¢ be the 
conductor of this extension. The Artin map gives an isomorphism 


[eef HH = G(L/R®) 


for some congruence subgroup H. 
Let S;,, be those primes in L whose Frobenius element is o: 


Sto = 1B: (PB, L/K) =o}. 


(Note that U,ec Sto gives all primes above those in S.) Let Sz be those primes in L’ := L? 
below a prime in L’: 
sp = {PN L7: Be hohe 

We have a bijection S,, = Sz, by $B + BML’, because o generates the decomposition 
group Dz /«(%), and L/L?+/*«) has no splitting. 

Now the density depends only on primes of degree 1 over Q. Since H is a subgroup of 
index f in I}., by ning Sy has density - 

Given p such that (p, L/K) = C, how many primes 8 above p satisfy (8, L/K) = o? 
Choose ‘Bo above p. The primes above p are 78 for 7 € G(L/K). Each prime is hit 
|Di/« (B)| = f times. Now we have (70, L/K) = a iff 


T(Po, L/K)r' =o. 


The number of such 7 is equal to the order of the stabilizer of the conjugation action (i.e. 
the number of elements commuting with 7) which is N divided by the number of elements 


in an orbit, i.e. far Hence the number of ‘8 lying above p with (8, L/K) =a is 


N/|C| _ N 
ij IOV 
The density of Sz, is +. Now every (ag good primes in LZ correspond to 1 good prime down 
below, so we get the desired density to be 


\/f IC 


N/(IC|f) NS 


3.2 Applications 


Often, we will need Chebotarev just for the existence of infinitely many primes with (p, L/K) = 
C, or just for the existence of a prime after we exclude a set of zero density. Here is a typical 
application. 


Corollary 29.3.6:  cor:chebotarev-resielat Let K be a number field. There exist infinitely many 
primes p of Q such that there is a prime p | p of kK with (p,L/K) =C and Mp = p. 
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Proof. Chebotarev’s Theorem says there is a positive Dirichlet density of primes p 
with (p,L/k) = C. The Dirichlet density of primes p with residue degree greater than 1 
is 0, because a sum of terms of the form of with f > 2 converges. Hence infinitely many 
primes must remain. 


Definition 29.3.7: For two sets S,T, we write S © T to mean S C TU A for some finite 
set A, ie. we have inclusion except for finitely many elements. We write S ~ T if SST 
and S&T. 


Definition 29.3.8: Define 


Spl(M/k) = {p prime of K splitting completely in M}. 
Spl(M/K) = {p prime of K unramified in M, f(98/p) = 1 for some in M}. 


If p is unramified in K and f(8/p) = 1, we say that $8 is a split factor of p. 


Note Spl(M/K) = Spl(M/K) if M/K is Galois. 
The following says that the primes that split in a Galois extension characterize the 
extension uniquely, as well as giving inclusions between extensions. 


Theorem 29.3.9: spiit-chebotarey Let L/K and M/K be finite field extensions. 

1. If L/K is Galois, then L C M iff Spl(M/K) © Spl(L/K). 

2. If M/K is Galois, then L C M iff Spl(M/K) ~ Spl(L/K). 

3. If L/K and M/K are Galois, then L = M if and only if Spl(M/K) = Spl(L/K). 
In (1) and (2), inclusions actually hold. 
Proof. 


1. Suppose L C M, and p € Spl(M/K). Say that PB | p and f(/p) = 1. Let P! = 
BOCK. Then f(’/p) = 1. Additionally, e(B8/p) = 1 implies e(’/p) = 1. Since 
L/K is Galois, the ramification indices and residue field degrees are equal for all primes 
above p. Hence Spl(M/K)C Spl(L/K). 

Conversely suppose Spl(M/K) A Spl(L/k). Let N/K bea Galois extension containing 
L and M. It suffices to show G(N/M) C G(N/L); then Galois theory gives M D L. 
Take any 0 € G(N/M). By Chebotarev Density there exist infinitely many 
primes p in Kt such that (p, N/K) = [o]. For such a prime p, let $B be a prime lying 
above p in N such that (8, N/K) = o and let 8’ = BO Gy. For such a prime we 
have 

a=a(a)=a” (mod ¥’), a€ Gy. 
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The left equality holds because o fixes M and the right equality holds by definition 
of (8, N/K). Hence Gy /' C Foy = Ox /p, and equality must hold. In other words, 
f ('/p) = 1. Hence p € Spl(M/K). Since Spl(M/K) < Spl(L/K), we can take p such 
that p € Spl(Z/K) as well. Then o|; = 1 hence G(N/M) C G(N/L) and M D L. 


2. Suppose L C M. Then any prime splitting completely in M splits completely in L, so 
Spl(M/K) C Spl(L/K). 
Conversely suppose Spl(M/K) © Spl(L/K). Let L®! be the Galois closure of L. Since 
M/K is Galois, Spl(M/K) = Spl(M/K); we also have Spl(L/K) = Spl(L8!/K) (Any 
prime splitting completely in L splits completely in the Galois closure, by exercise 2 in 


8). Thus 


Spl(M/K) C Spl(L#"/K) 
and we can apply part 1 to get L8! C M; a fortiori L C M. 


3. Apply part 2 twice. 


4 Splitting of primes 


sec:splitting 


4.1 Splitting of primes 
Theorem 29.4.1: tmspiittingot primes Let L/K be an extension of number fields. 


1. If L8"/K is abelian, then there is a modulus m and a congruence subgroup modulo m 
such that 


Spl(L/K) = {prime p € H}. 
2. If there exists R € Cx(m) = IR /Px(1,m) such that 
{prime p:p (mod Px(1,m)) = &} © Spl(L/K), 


(i.e. all but finitely many primes satisfying a certain modular condition split) then 
L8"'/K is abelian. 


In other words the law of decomposition of primes in an extension L/K is determined by 
modular conditions iff L/K is an abelian extension. 


Proof. 3 As Spl(L/K) = Spl(L8""/K), it suffices to consider L/K Galois. 


3This proof is from|http: //mathoverflow.net/questions/11688 
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Part 1: By global class field theory, the kernel as the Artin map I? — G(L/K) is a congruence 


subgroup H. But we have by Proposition 3| that p splits completely iff wz/«(p) = 
(p, L/K) =1. Hence 


H = ker(#z/x) = Spl(L/K). 
Part 2: Let Ky be the ray class field of K modulo m and M = LKy. There is a natural map 


P= D1 X po: G(M/K) > G(Km/K) x G(L/K) > Cx(m) x G(L/K) 


where the second map is given by PEK in the first component. 
For all but finitely many primes, we have the following string of facts. 


1 per. 
2. p € Spl(L/K). 
3. (p, L/K) =1. 
4. For any prime ¥ | p in M, p((8, M/K)) = (8, 1). 


) 
(1) == (2) is by assumption, (2) <> (3) is Proposition [24)24.1.3] and (3) <= (4) is 
by compatibility of the Frobenius elements (the map G(M/k) > > GL m/K) x G(L/K) is 
compatible with the map on residue fields G(m/k) > G(km/k) x G(I/k)). 

Suppose o € G(M/K) and p(c) = (R,g). By Chebotarev’s Theorem there exist primes 
3% | pin M and K, respectively, such that (8, M/K) =o. But (1) = (4) shows that 
g=1. Hence 

P(G(M/K)) 0 (8, G(L/B)) = {(&, D}- 


Since p is a group homomorphism that is surjective in the first component, p(G(M/Kk)) 9 
(R’,G(L/K)) must consist of 1 element for every &’, in particular for &’ = 1. Thus if 
p(o) = (Px(1,m),g), then g = 1. Given a prime p splitting completely in Ky, i.e. p such 
that p € Px(1,m), take any P| pin M. Then p(B, M/K) = (Px(1,m),g) for some g, so 
g=1and 

(p, L/K) = (8, M/K)|1 = po(B, M/K) = 9 = 1, 


i.e. p splits completely in L. Thus Spl(Lm/K) © Spl(L/K), showing by Theorem|2 that 
oe vem 


For nonabelian extensions, the set of primes that split has to be specified by more than 
just a modulo condition. 


Example 29.4.2: cxprime-split-nonab We show that a prime splits completely in Q(¢3, V2) iff 
p =1 (mod 8) and pis in the form x? + 27y”. 

Note that Q(¢3, V2) is the splitting field of x? — 2 = 0. ie an unramified prime, p splits 
completely iff the residue field extension has degree 1, i.e. x*® — 2 splits completely in F, 
This is true iff 2 i s a cubic residue modulo p. As we saw in Theorem [29.1.14] this is true iff 
p is of the form x? + 27y?. 
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4.2 Roots of polynomials over finite fields 


We can recast the problem of splitting behavior in terms of finding roots of univariate 
polynomials over finite fields. Let L/K be a finite extension, and f € @x|X] be the minimal 
polynomial of a primitive element in L/kK. Then Theorem |15[15.6.3] tells us that for a prime 
p relatively prime to the conductor of L/K, the factorization of f in Ox /p corresponds to 
the factorization of p. In particular, p splits completely iff f splits completely, and p has a 
split factor iff f has a root in Ox/p. 


Definition 29.4.3: Let N,(f) denote the number of zeros of f in Cx /p. 
Thus we can rephrase Theorem |29.4.1| as follows. 


Theorem 29.4.4: thm:roctsover Let f be an irreducible polynomial over kK. Let a be a root 
of f and L be the Galois closure of K(a). 


1. For all except a finite number of primes, N,(f) = m iff Yz/K(p) = [o] for some 
oa € G(L/K) fixes m of the roots of L. 


2. The sets {p : N,(f) =m} are given by modular conditions iff L/K is abelian. 


{cE€G(L/K):0 fixes m roots} 


3. The density of primes p such that N,(f) =m is EK] 


Proof. The first item follows from Theorem|15]15.6.3} The second item follows from this and 
Theorem |29.4.1} The third item follows from the Chebotarev Density Theorem |29.3.2 


Even the reciprocity laws (at least, weak reciprocity) can be put in the same framework: 
in a field K containing nth roots of unity, a is a perfect nth power modulo p iff x” — a 
splits completely modulo p (the polynomial viewpoint), i.e. the prime p splits completely in 
K(*/a)/K (the splitting viewpoint). 


5 Hilbert class field 


sec:hcf 


Definition 29.5.1: The Hilbert class field of K is the largest abelian field extension of 
K unramified over K at all places. (For infinite places this means that no real embedding 
becomes complex.) It is denoted Hx. 

The large Hilbert class field of K is the largest abelian field extension of K unramified 
over K at all finite places, with no restrictions for infinite places (i.e. they are allowed to 
ramify). It is denoted H}. 


Note if K is already totally complex then Hy = H}. 
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Proposition 29.5.2: p:-ncr The Hilbert class field and large Hilbert class field exist, and the 
global reciprocity map gives isomorphisms 


G(Hx/K) & Cl 
G(H~/K) = Cp. 


Proof. The Hilbert class field is exactly the ray class field corresponding to the modulus 1, 

and the narrow Hilbert class field is exactly the ray class field corresponding to the modulus 

m = [],, rea) U- Indeed, by global class field theory the fields corresponding to congruence 

subgroups of Cx(1) are just the fields unramified over K, and the fields corresponding to 

congruence subgroups of Cx (m) are just the fields unramified at every infinite place. 
The global reciprocity map gives the desired isomorphisms. 


The most interesting property of the Hilbert class field is the following. 


Theorem 29.5.3: Let K be a global field. Every fractional ideal of K becomes principal 
in the Hilbert class field L of K. 


Proof. Let M be the Hilbert class field of L. By Proposition [29.5.2| the global reciprocity 
map gives Cx — G(L/K) and C, > G(M/L). We will transfer the map Cx — C;, to the 
Galois groups. By definition, L is the maximal unramified abelian extension of K; since M 
is also unramified over kK, L is the maximal abelian subextension of M/k. But by Galois 
theory, intermediate Galois extensions correspond to quotient groups of G(M/K). This 
means that 
G(L/K) = G(M/I)/G(M/L) 

is the largest abelian quotient of G(L/K). From group theory this means that G(M/L) is 
the derived subgroup (G(L/K))’. 

The following diagram commutes by compatibility of the Artin map (the last diagram in 


Theorem together with Theorem |28]28.5.1) 


Ce “2S G(L/K)® 


|v 


OM/L G(M/L)*® 


ia 


CL 


where V is the transfer. 

However, the transfer map is 0 by Theorem |25}25.11.13] and the fact that G(M/L) = 
G(L/K)'. Hence the map Ck — Cy is trivial, i.e. every fractional ideal of kK becomes trivial 
in L. 


6 Primes represented by quadratic forms 


sec:primes-rep-q We now give a complete characterization of which primes can be represented by 
which binary (positive definite integral) quadratic forms. First consider the form x? + ny’. 
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A prime is in the form p = x? + ny? iff p splits as pp = (2 + y/n)(x — yn) in Z[,/—n], 
with its factors being principal ideals. We can think of this as saying that p goes to O in 
the ideal class group of Z[,/—n]. Unfortunately, this is not the same class group as Cx. 
However, this class group is essentially a quotient of a ray class group (Theorem [17]17.6.2). 
But by class field theory, we can find a field extension L such that the Artin map to G(L/K) 
is an isomorphism. The primes in the kernel of the Artin map are exactly those that split 
completely in L, so this relates the equation x? + ny? to the splitting of primes in the Hilbert 
class field. 


Definition 29.6.1: Let @ be an integral quadratic order and f := disc(@). 
1. Suppose f < 0. The field LZ corresponding to the congruence subgroup 
Px(Z, f) = {(a) €Ix(f): a (mod f)€Z (mod f)} C Ix(f) 
is called the ring class field of @. 
2. Suppose f > 0. The field L corresponding to the congruence subgroup 
Px(Z, oof) = {(@) €Ix(f):a (mod f)€Z (mod f)} C Ix (f) 
is called the ring class field of @. 


The reason for this definition is that Ix(f)/Px(Z, oof) = 1(@)/P*(@) = Ct(@) via the 
map a++ aM @, by Theorem /17/17.6.2| (Ignore the co when K is imaginary; in this case 
CT(@) = C(@).) 


Example 29.6.2: When @ = @x, with K/Q a quadratic extension, then the ring class field 
is just the large Hilbert class field of K, because [(@)/P*(@) = Cp. 


Theorem 29.6.3: thm:p=x2+ny2 Let n > 1. Let Q be a quadratic form that corresponds to 
a C R under the Gauss correspondence let K = Frac(R), and let p be an odd 
prime not dividing f := disc(R). Let 6 be the ideal corresponding to a under the map 
Tk(f)/Px(Z, oof) + I(@)/P*(@) = C*(@). Let L be the ring class field of R and suppose 
(L/K,6) =o. Then 

f represents p = > (L/Q,p) = [o| 


where [a] denotes the conjugacy class of o in G(L/Q). 

Proof. Let K = Q(.,/—n). We have the following string of equivalences. 
1. Q represents p. 
2. pR = pp in R for some ideal p in the same ideal class as a. 


3. pOx = pp for p ~ b where the ideals are considered in Ix (f)/Px(Z, cof). 
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A. pO ye = ph tor (L/ KD) =o, 
5. (L/Q,p) = [0]. 


The equivalence (1) <= (2) follows from Proposition [17]17.5.4] We have (2) <> (3) 
by Theorem which gives an isomorphism Ix (f)/P(Z,oof) > I(@)/P*(@) = 
C*(@) by sending ato aN@. By definition of ring class field, the Artin map is an isomorphism 
I()/Px(Z, cof) + G(L/K), so (3) <=> (A). 

For (4) <=> (5), note by definition of the Artin symbol that (4) is equivalent to 


p splits in Ox and o(a) =a'*! (mod 9) for all a € L 


where 8 is any prime dividing p in L. Since p is unramified, p splits in @x iff [k : F,] = 1, 
iff || = p. Hence the above is equivalent to 


o(a)=a? (mod 3%) 


This says exactly that (L/Q,p) = [o]. 


Corollary 29.6.4: Suppose n # 0 is an integer. 
1. Let L be the ring class field of Z|,/—n]. Then p can be represented as 
p=x’tny’, 2,yEeZ 


if and only if p splits completely in L. 


2. For —n = 1 (mod 4), let L’ be the ring class field of Z [=]. Then p can be 
represented as 
2 l-n 4 
pH=x+aryt+ a Y 
iff p splits completely in L’. 


Remark 29.6.5: It is not hard to show that we can replace the conditions by the following 
uniform statement: 4p can be represented as 4p = x? + dy? iff p splits completely in the 
order of discriminant —d. 


Proof. These quadratic forms correspond to the principal ideals in Z[,/—n] and Z eae 


respectively (Example]|17|17.5.3), so the theorem says p can be represented by the quadratic 
forms iff 


(L/K,p) =1. 
This is true iff p splits completely in L (Proposition |24]24.1.3). 


How is this useful? Algorithmically, there are fast ways to find solutions to p = x7 + ny? 
(Cornacchia’s algorithm), so we can obtain primes splitting completely in the Hilbert class 
field Hx. This means that the minimal polynomial of Hx/K factors completely modulo p. 
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As we will in Chapter [41] the roots are the j-invariants of CM elliptic curves; the fact that 
they are in F,, gives us an easy way to calculate the action of the class group on elliptic 
curves. 

Additionally, this description of solutions to p = x? + ny? gives a way to find the density 
of primes represented by a quadratic form. 


Theorem 29.6.6: tnmdensity-cform Let @ be a primitive positive definite quadratic form of 
discriminant D < 0, and let S be the set of primes represented by Q. Then the density of 
primes d(S) represented by S' is 


Q properly equivalent to its opposite, 


1 
(DY? else, 


= 
d(S) -_ ie 


where h(D) is the class number of the quadratic ring with discriminant D. In particular, Q 
represents infinitely many prime numbers. 


Note “Q properly equivalent to its opposite” is equivalent to saying that the ideal class 
corresponding to @ has order dividing 2. 


Example 29.6.7: h(—27) = 3 so § of all primes can be represented by the form «? + 27y’. 


In fact, the ring class field of Z[,/—27] is Q(¢3, 2), so p = 27+27y? iff p splits completely 
in Q(¢3, W2). This shows Example |29.4.2]in a different way. 


Proof of Theorem |29.6.6| Let K be the quadratic field of discriminant D. 

By Theorem [29.6.3] p is represented by Q iff (L/Q, p) = [a] where L is the ring class field 
of the order corresponding to @ and Q corresponds to ao under the Gauss correspondence. 
We need to find [co], so we first need to understand G(L/Q). 

Since C(@) = Ix(f)/Px(Z, f) = G(L/K) via the Artin map, 


[L: K] =|C(6)| =A(D) => [L: Q|=2h(D). 


Next we show G(L/Q) = G(L/K) =» G(K/Q) where, denoting complex conjugation by 
o € G(K/Q), we have oro! = 77! for allt € G(L/K). Let m be the modulus corresponding 
to f@x, where f is the conductor. By construction of L, it is the unique field such that 
ker(wz/K) = Px(Z, f). However, because the Artin map commutes with Galois action (see 


the third diagram in Theorem |27.4.10), 
ker(Wo(1)/k) = 0 ker(r/x) = oPR(Z, f) = Px(Z, f). 


Uniqueness hence gives o(L) = L, ie. o € L. Hence |G(L/Q)| = 2|G(L/K)| = [L: Q|, 
giving that L/Qis Galois. Givent € G(L/K), by surjectivity of the Frobenius map|28|28.2.8| 
T = (L/K,p) for some p. Then by Lemma |24]24.1.2] 


oro”! = o(L/K,p)o~! = (L/K, op) = (L/K,B) = (L/K,p)t = 7, 
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as needed. 

From the structure of G(L/Q), we see that the conjugacy class of any element o is 
{o,0-'}. By the Chebotarev density theorem the density of primes such that 
(L/Q, p) = [o] = {o,0~*} is hence 


as needed. 


7 Introduction to the Langlands program 


sec:intro-langlands In this section, we’ll give the big picture, and be content with morally, rather 
than mathematically correct, statements. 

Much of modern number theory is occupied with the relationship between the following 
three objects. 


1. Algebraic varieties, i.e. polynomial equations. 


2. Galois representations, i.e. continuous functions from G(K/K) to algebraic groups 
such as GL,,(C). 


3. Automorphic forms, i.e. continuous functions defined on an algebraic group on the 
ideles, such as GL, (Ax), and satisfying certain conditions. 


The relationship between Galois representations and automorphic forms is known as the 
Langlands correspondence. More precisely, there is a conjectural correspondence 


( cuspidal automorphic } (irreducible an 
} Pepresentations of GL, (Ax) )f “> ae (K/K) > GLr( c) | 
rt algebraic at oo J algebraic at f J 


We can define L-series from both Galois representations and automorphic forms. L-series 
from Galois representations arise more naturally in number theory (because it is relatively 
easy to go from algebraic varieties to Galois representations), but as automorphic forms are 
analytic objects, L-series of automorphic forms are known to satisfy more properties. The 
Langlands correspondence allows us to show that L-series of Galois representations arise 
from automorphic forms, hence have nice analytic properties as well. This allows us to prove 
various results about algebraic varieties, such as density theorems on the number of solutions 
over finite fields, for example the Sato-Tate conjecture. 

We first give some more precise definitions, then describe this relationship in the 1- 
dimensional abelian case (which we have in fact proved!), and then give an overview of how 
it generalizes. 
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7.1 Definitions 


Definition 29.7.1: Let k be a topological field (for instance, C or Q;), and let V = k” be 
a n-dimensional vector space over k. A n-dimensional Galois representation of K over k 
is a continuous homomorphism 


p: G(K*/K) > GL(V) = GLa(k). 


Let p be a prime of k. We say p is unramified at p if [,(4¢°/h) C ker(p). 

Let AK be a number field. Let Frob(p) be a Frobenius element of p in Ky, (defined in 
G(K,/K,) up to I(K,/K,)). Define the (modified) characteristic polynomial of p at p 
to be 

P,(X) := det(1 — X - p(Frob(p))|V14"/*)), 


(Here, V7 (K»/K») denotes the subspace of V fixed by the inertia group. P,(X) is well-defined 
because Frob(p) is defined up to I(Ay/K,), and I(Ky/Ky) C ker(p|,,1a%/x,))- In particular, 


if p is unramified at p, then V = Vi(Ko/Kp) ) 
We can now define the L-function associated to a Galois representation. 


Definition 29.7.2: In the above, suppose V is a complex vector space and K is a number 
field. The local L-factor at a prime p is 


L,(p, 8) = Pa) 
The Artin L-function of p is* 


L(p, 8) = II Zo(, 8). 
p 


We have the following conjecture. 
Conjecture 29.7.3 (Artin’s conjecture): Every Artin Z-function has analytic continuation 
to C and satisfies a functional equation. 


7.2 Class field theory is 1-dimensional Langlands 
For a different take on some of these ideas, with concrete examples, see Dalawat |Dal11}. 


7.2.1 Galois representations are automorphic representations 


We rephrase global class field theory in the form that generalizes under the Langlands pro- 
gram. 


4Sometimes infinite places are included. The factors at infinite places take more thought to define so we 
exclude them here. 
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Theorem 29.7.4 (Rephrase of GCFT): tnm:repnrase-gete There is a bijection between continuous 
homomorphisms x : Aft./K*(KX)° > C* and continuous homomorphisms p : G(K/K) > 
GL,(C), given by the following. 
{x : Ax /K*(KX)° + C*} © {0 : G(K/K) > GLi(C)} 
XH xOdbK 
Proof. From Theorem|24]24.6.5| the Artin map gives a topological isomorphism Aj /K*(KX)° > 


G(k*>/k). It remains to note that any function G(K/K) — GL,(C) factors through 
G(K/K)*> = G(K®>/K), since GL1(C) is abelian. 


The functions on the left side have a special name. 


Definition 29.7.5: A Hecke character is a continuous homomorphism Aj-/K*(K%)° > 
C*, or equivalently, a homomorphism 


x: Ce 3S':={reEC: |z|=1} 


with finite image. The conductor of y is the smallest modulus m such that y factors 
through A; /K*Ux(1, m) = Cx(m). 


The homomorphisms y : Ay. /K*(KX)° > C are “automorphic functions” on GLi(Ax), 
a.k.a. Hecke characters, and the homomorphisms p : G(A/K) — GL (C) are 1-dimensional 
“Galois representations.” Our correspondence is unsatisfactory, however, because we would 
like to get all continuous homomorphisms Aj; /A™ — C*, not just those factoring through 
Ax /K* (KX). Since G(k??/K) has the profinite topology, any continuous homomorphism 
G(K/K) > GLi(C) must have finite image, while functions Af/K* — C* can have infinite 
image. To remedy this, we introduce functions G(K/K) — GL,(C) with infinite image (no 
longer continuous under the complex topology). 

For simplicity, we just consider the case of Q. 


Example 29.7.6: We say a function 7 : Ag/Q* — C is algebraic at oo if 7(ig(x)) = 
sign(x)'|a|" for some m € {0,1} and n € Z. We characterize all the continuous homomor- 
phisms 7 : Ay. /K* — C* (“Gréssencharacters” ) that are algebraic at oo. 

It is enough to introduce 1 more character. Let ¢ be a prime of Q. Let |-| : Ag/Q* > C* 
denote the map [x] = [Tyevg |Zv|v, and define x2 by 


xe: G(Q/Q) —* G(Q*/Q) = G(Q(6.)/Q) ++ Z* =], ZX —» GLi(Z,). 


(We say y¢ is “algebraic at @.” Note there is a noncanonical field isomorphism Q; = C, so 
we can think of GL;(Z,) as being “inside” GL;(C).) 

Every continuous homomorphism 7 : A; /K* — C% algebraic at oo is in the form 
|-|"-., where y is a Hecke character. We can extend the correspondence in Theorem |29.7.4] 
by associating |-| with yz: 

t= |-|"-x XE + (K0 OR) 


where the right-hand side is now viewed in Q, instead of C. 
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7.2.2 Artin [-functions are Hecke L-functions 


Associated to each Hecke character is a L-function. 


Definition 29.7.7: Let x be a Hecke character and m be the conductor of x. The L- 
function associated to y is 


1 
HO09) = Wyte 


Because x admits a modulus, Hecke L-series have nice analytic properties. 


Theorem 29.7.8 (Hecke, Tate): tim:-anatytic-con. Every Hecke L-series admits an analytic con- 
tinuation to C and satisfies a functional equation. 


For the details, see Tate’s thesis in [CF69]. 


Theorem 29.7.9: Any 1-dimensional Artin Z-function is a Hecke Z-function. Hence it has 
analytic continuation and satisfies a functional equation. 


Proof. Let p: G(k/K) — GL,(C) be a 1-dimensional representation. By Theorem |29.7.4| 
p(®,) = x(p) for some Hecke character y : Ax /K* — C*. Let m be the modulus of p; note 
it is also the conductor for y. Then 


1 1 


1 = p(,) 9p t I 1— x(p)Stp-s 


L(p, s) = Il 


prm 


= L(x, 8). 


This theorem is another way of saying that the Artin map factors through a modulus, 
and this is basically what allowed us to get all the density results in this chapter. 


7.2.3 Algebraic varieties and Galois representations 
We give examples of how to get Galois representations from algebraic varieties. 
First consider the variety Q* = {x EQ:x4F O}. It is a group under multiplication, 


and the torsion points Q™[m] are exactly the roots of unity fm. We can define a Galois 
representation by considering the action of G(Q/Q) on the /-power roots of unity. Define 
the Tate module of Q” by 


T(Q*) = lim Q” [C"] = Yim pron = Ze. 


n 


Then G(Q/Q) acts naturally on T,(Q” ) so we get a representation 
p: G(Q/Q) > Aut(Tr(Q*)) & Aut(Ze) 9 GL (Q,) 
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sending the element ¢g(p) to p. The corresponding L-function is just a translate of the ¢ 
function, missing the factor ¢: [],z = This construction is a good analogy for what 
we will eventually do with elliptic curves, although it is a bit too “trivial” to capture any 
significant number theory facts. 

We give another example, with equations in 1 variable, which is a bit less natural but 
show more of the number theory. Consider the variety defined by f(X) = 0 where f € K[X] 
is a irreducible polynomial. Let a be a root, and L be the Galois closure of AK (a) over 
Kk. Let ay,...,Q be the roots of f in L. G(K/K) acts by permuting the a;, so we get 
a representation G(K/K) — S,. We can embed S;,, in some general linear group, to get 
p: G(K/K) > GL,,,(k) for some k. Then to find how many roots f has modulo p, we can 
look at the trace of p(Frob(p)). 

For example, consider f(X) = X?—X—1 over Q. We get a representation p: G(K/K) > 
S'3 — GL2(C), where we embed $3 <+ GLo(C) as follows: we have a natural permutation 
representation S3 — GL3(C); now take out the trivial representation to get S3 ~ GL2(C). 
From this description we have N,(f) = tr(p(Frob(p))) + 1, so we can get the number of 
solutions of X3 — X — 1 =0 (mod p) from looking at the trace of Frobenius. Constructing 
the L-function, the trace of Frobenius becomes the coefficient of 4+. Now p comes from 
an automorphic form, so L comes from a 2-dimensional automorphic form, i.e. a modular 
form. We can write this modular form explicitly using theta functions or as an eta quotient. 
At the end of the day, we have this striking fact: For p # 23, the number of solutions of 
X?— X —1=0 (mod p) is N,(f) = a» +1, where ay, is the coefficient of the modular form 


J 1 x? x 2 xv x 2 = nm 
qtr =~ St ag ee ) = yaaa 
(eyez? n=l 


> 
ll 
fan 


(See Serre’s article [Ser03].) In this example we have traced out a relationship 


(algebraic variety)—(Galois representation) (automorphic form). 


7.3 Elliptic curves and 2-dimensional Langlands 


7.3.1 Galois representations and automorphic representations 


Definition 29.7.10: A 2-dimensional automorphic form is a continuous function GL2(Q)\ GL2(Ag) 
satisfying certain conditions. 
A large class of 2-dimensional automorphic forms can be related to modular forms. A 


holomorphic function f(z) : # — C is a modular function of weight k for a congruence 
subgroup [ C GL2(Z) if 


f(yz) = (ez +. d)¥ f (z) for all y = (: 1) ar. 


*x x 


0 ‘) (mod nyt, we say f is of level N. Here H 


denotes the upper half-plane {z : S(z) > 0} and yz = 


If Pf =To(N) := {M é€s$L(Z):M= ( 


az+b 
cz+d° 
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A modular function is a modular form if it is holomorphic at cusps of H* = HUP!(Q). 
A cusp form is a modular form that vanishes at the cusps. 


There is a way to go from modular forms to Galois representations; this is better under- 
stood than going in the opposite direction. One of the biggest theorems in the 2-D case is 
Serre’s conjecture, now a theorem, that tells us that we can go from Galois representations 
to modular forms in certain cases. 


Definition 29.7.11: We say a Galois representation is modular if there exists a cusp form 
f of some level N and a finite set S' such that 


f= Dang", tr(p(Frob(p))) = ay for p ¢ 5. 


Theorem 29.7.12 (Serre’s conjecture; Khare, Wintenberger): Any irreducible odd Galois 


representation p : G(Q/Q) — GL2(F,) is modular. 


7.3.2 Elliptic curves and Galois representations 


Given an elliptic curve, we can define a Galois representation by looking at its torsion points. 


Definition 29.7.13: Let EF be an elliptic curve over a number field Kk. It is known that the 
m-torsion points E[m] over K satisfy 


E|m] = Z/m x Z/m. 


(See Silverman |Sil86} ITI.6.4].) 
Define the ¢-adic Tate module of FE by 


TE = jim BE" 7 


As G(K/K) acts on E[é"] for each n, it acts on T;E, so we get a map 
G(K/K) > Aut TE = GLo(Zp) 4 GLo(Qy), 


called the ¢-adic Galois representation of E.° 


Thus we can define the L-series of an elliptic curve, by defining it as the L-series of the 
corresponding Galois representation. (Roughly speaking, this definition is independent of 
the choice of ¢.) We’ll flesh out this definition in Section Thus we have the (tentative) 


correspondences 


eq:ec-lang( Elliptic curves) --» (Galois representations) --+ (cusp forms) (29.10) 


eq:ec-lang2(L-series of elliptic curve) --+ (Z-series of modular form). (29.11) 


Alternatively, let VeE := TyE @ Q and consider G(K/K) as acting on VE. 
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Again, more is known about L-series of modular forms since modular forms have nice ana- 
lytic properties and transformation properties. The theory of Jacquet-Langlands establishes 
analytic continuation and functional equations for L-series coming from modular forms. 


This relationships in (29.10) and (29.11) are involved in the proof of two big theorems. 
1. We now know the dotted lines in (29.10) are true, thanks to the following. 


Theorem 29.7.14 (Modularity Theorem; Taniyama-Shimura-Weil): All elliptic curves 
are modular. 


The heart of this proof is in showing that the Galois representations associated to the 
elliptic curves come from modular forms. This theorem (or rather, its earlier version 
with semistable elliptic curves) is what allowed the proof of Fermat’s last theorem: 
there is no nontrivial solution to a” + b" = c” for n > 2. A nontrivial solution would 
give rise to an elliptic curve associated to a modular form that does not exist. 


2. By working with L-functions of the elliptic curves, and reinterpreting them as L- 
functions of certain automorphic forms as in (29.11), one can prove the following. 


Theorem 29.7.15 (Sato-Tate conjecture; Barnet-Lamb, Geraghty, Harris, Taylor): 
Let EF be an elliptic curve without complex multiplication, and let E(F,) denote the 
set of solutions to EF over F,. The density of primes p with |E(F,)| € [p+1+a,/p,p+ 
1+6,/p), for -l1<a<b<1lis 


d({p : |E(B,)| € [p+ 1+ aVB,p+ 1+ b/}}) = © [ vin wate. 


By the correspondence between elliptic curves and modular forms, another way to 
phrase this theorem is that the distribution of coefficients of certain modular forms is 
the same “semicircle” distribution. 


This theorem is like the elliptic curve analogue of the Dirichlet’s theorem on the dis- 
tribution of primes in congruence classes. 


8 Problems 


3.1 (from Serre, [Ser03]) Using Chebotarev’s Density Theorem, prove the following. 


Theorem: Let f € Z[X] be an irreducible polynomial of degree n > 2. Let N,(f) 
denote the number of zeros of f in F,. Then the set P(f) of primes with N,(f) = 0 
has a density co(f). Moreover, co(f) > +, with strict inequality if n is not a prime 
power. 


You may use the following theorem from group theory. 


418 


Number Theory, §29.8 


Theorem (Jordan): Let G is a group acting transitively on a finite set S with n > 2 
elements. There exists g € G having no fixed point in S. If n is not a prime power, 
then there exist at least 2 such g. 


3.2 (All primes divide some coefficient of A) Let @ be a given prime, and Ky be the maximal 
extension of Q ramified only at @. Given that there is a continuous homomorphism 
(a.k.a. Galois representation) 


pe: G(Ke/Q) + GLo(Fe) 


such that 
tr(e(Frobx,/q(p))) = T(P) 


for all p 4 @, and that there is an element in im(/,) with trace 0, prove that a positive 
proportion of primes p have the property that 


| 7(p). 


Note. Here 7 is Ramanujan’s tau function, the coefficients of a certain modular form 
A. For more on the relationship between Galois representations and congruences for 
coefficients of modular forms, see Birch and Swinnerton-Dyer |SD72}. 


4.1 In Section [4] we showed that L/K is abelian iff the primes that split can be character- 
ized by a modular condition. In this problem, we do more: given a Galois extension 
L/K, characterize the maximal abelian subextension by looking at the primes that 
split. 


(a) Let m be a modulus for AK, and suppose L/K is a Galois extension. Let Hm be 
the subset of the ray class field Cx(m) defined as follows: 


Hy = {: There exists p € RK such that p splits completely in L}. 
Show that Hy» is a subgroup of Cx(m). 


(b) Suppose we are given the groups Hy» for all m. Characterize the maximal abelian 
subextension of L/K. 


6.1 Prove an analogue of Theorem |29.6.6]for positive discriminants. 


6.2 Let n > 0 be an integer such that K = Q(./—n) is an imaginary quadratic field, and 
let 

(2 2 _ 

e+ ny", n=1 (mod 4 

Q(z,y) = | 


a+ary+5ty?, n=3 (mod 4). 


(a) Find a condition on G(Hx/K) so that for all but a finite number of primes, the 
primes represented by Q are given by a modulo condition. In other words, find 
all n such that there exists m and a set of residues S modulo m such that if p { m, 
then p is represented by Q iff p is congruent to a residue in m. (Hint: combine 
the results of Section [4] with Section (6) 
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(b) Find some values of n for which | Cl(/’)| ¢ 1 and such that the primes represented 
by Q are given by a modulo condition. 


(c) Suppose G(Hx/K) satisfies the property you found in part 1. Characterize all 
n, not necessarily prime, such that Q represents n. (For simplicity, you can just 
consider n | m.) Compare to the statement in Example |17/17.3.4}) 


1. (Genus theory) It is useful to group the equivalence classes of quadratic forms with 
given discriminant into genera (plural of genus). 


Definition 29.8.1: Define a similarity relation between primitive quadratic forms of 
discriminant d as follows. We say Q; ~ Q2 if Q; and Q2 represent the same values in 
(Z/dZ)*. The similarity classes are called genera. 


In this problem you will find an easy way to characterize the genera of discriminant d. 


(a) Let H be the subgroup of C(d) such that C(d)/H = C(d)[2] where G[n] denotes 
the n-torsion subgroup of G. 
Prove that Qi,Q2 € C(d) are in the same genera iff Q,,Q»2 are the same in 
C(d)/H. 
In particular, conclude that the number of genera is a power of 2. 


(b) Let M be the ring class field of AK and let L denote the subextension of M/k 
such that G(L/K) = C(d)[2]. (That is, under the Galois correspondence, L C M 
corresponds to H C C(d) = G(M/K).) Prove that L/Q is the maximal abelian 
subextension of M/Q. 


The fact that L/K is abelian, while M@/K may not be, makes it much easier to prove 
results pertaining to a genus of quadratic forms rather than an equivalence class of 
quadratic forms. 


2. (2?) Let f,g € Q[X] be two irreducible cubic polynomials. How can you determine 
algorithmically whether f,g have roots a, 8 such that Q(a) = Q(6)? (from {http : //| 


math.stackexchange.com/questions/34522/cubic-polynomials-that;-generate 
rq=1) 
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Chapter 30 


Elementary estimates for primes 


1 Chebyshev’s Theorem 


Today we prove some asymptotic results about the distribution of prime numbers. Specifi- 
cally, we derive estimates for the prime-counting functions 


D(x) = ¥/ In(p) 


(2) = FS In(p) 
Te). py 1 


Note that we will always use p to denote a prime. 

Lacking the tools of complex analysis, it is difficult to find the exact asymptotic formulas; 
however, our elementary methods suffice to determine the asymptotics up to a constant 
multiple. Our main result is Chebyshev’s Theorem: 


Theorem 30.1.1: [?, Theorem 6.3] There exist positive constants c; and cy such that 


cx < V(x) < w(x) < r(x) n(x) < coe. (30.1) 
for all x > 2. Moreover, 
vires Op eenen OE) eg NE (30.2) 
L—->CO xe XL 0O r xL—->0O x 
l 
lim sup uy) = lim sup pat) = lim sup oer) < 21n(2) (30.3) 
~—>00 Hi xr 0O XL Loo XL 


We will prove this in three steps. 
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1.1 Comparing the three functions 


Since all terms in the sum defining U(x) are included in the sum defining w(x), U(x) < w(x). 


For a given p there are | ™%)| choices for k so that p* < x, so 
In(p) 


In(x 
ve) =F ne) =O |B] mip) < Y mle) = a(0) n(n). 
pe <a psx In(p) psu 
This shows the middle two inequalities in (30.1). 
Given 0(x) < v(x) < (x) n(x), to show that the three quantities in (30.2) and (30.3) 
are equal it suffices to show that 
m(a) In(a) 


0 0 
lim inf va) > lim inf —————, lim sup ae) > lim sup 
LOO i b L700 x £00 & x00 


] 
n(x) In(z) (30.4) 
x 
To compare U(x) = ,<, In(p) and m(x) In(x) = X1,<, n(x), note that for p “close” to x, we 
have In(p) “close” to In(z) and relatively large, while the terms for small p will not contribute 
much to either sum. Thus we can just consider the terms with p > x'~°, where 6 € (0,1). 


Bx)> >) In(p) 


xi-$§ <p<a 


> SY In(c’®) 


= In(e!~®)(m(2) — a(a!-9)) 


= (1 — 6) n(x) (m(x) — x(a") 
> (1 — 6) In(x)(a(x) — 2) 
Hence 
v(x) = (1—46)(x) n(x) _ (1 — 4) In(z) 
a x xo 


Letting 5 > 0 gives (30.4). 


1.2 Upper Bound 


We show that V(x) < 2x 1n(zx). Instead of thinking about bounding (x), it is easier to think 
about bounding e”) = Pee 


Lemma 30.1.2: 3.?] For any x EN, 


lea" (30.5) 
pSu 
Proof. Use strong induction on x. For « = 1,2 the statement holds. The induction step 
from odd x > 1 to x + 1 is obvious, since x + 1 is not a prime. 
Consider the induction step from even x to x+1. Let x = 2n. The key idea is that there 
cannot be “too many” primes between n + 2 and 2n + 1, because... 
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1. These primes all divide beta = _ ie 


2. uae can easily be bounded from above: 
2n+ 1 at 2n+ 1 1 2n+ 1 ress Aone _ yn 
n 2 n we 1 yar a 


Te=- lle OU psa (™*1) cm 


pSa+l psntl n+2<p<2n+1 


Then 


Taking the logarithm of both sides of (80.5) gives 0(a) < (x — 1) In(4) < 2a In(z). 


1.3. Lower Bound 


We show that lim inf,_,., m(a)in(@) > In(2). First consider when x is even, say equal to 2n. 
Like in Section 3, we consider a binomial coefficient, this time ae We show that each 
prime cannot appear as a factor in oe “too many” times, so it can be bounded above by 
(2n)"?"). We can easily bound (-") below: 


2n gem 

> 

nj} ~ 2n 

2n 


ae Putting these two bounds together will give 


since it is the largest among 2, ees fans ( 
the desired bound for 7(2n). 
We need the following to count the highest prime powers dividing eee 


Lemma 30.1.3: [AD08! Lemma 6.3] For every positive integer n, 


mB) | 
> Lal 


where v,(m) denotes the largest integer i such that p'|m. 


L 
O(n!) = 


Proof. There are || multiples of p* less than or equal to n. In the sum ars ||, each 
k 


multiple of p* less than n is counted k times, once each as a multiple of p,p?,...,p 


From Lemma }30.1.3} we get 


+ (2) =» (22) =nteo0y-ann— 5 [22] -2[2 
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Since each term of the sum is at most 1, 
2n 2 oo S In(2n) 
An] ] > Ung) J * Tn)” 
2n oi 
2 < (?") = II p(n) < (Qnjre™, 


Taking logs and remembering z = 2n gives x1n(2) — In(x) < a(x) In(x), which gives the 

desired bound. For odd z, the value of x(2) inte) can be compared to the value for x — 1. 
Finally, (30.2) and (30.3), and the fact that all the prime-counting functions are positive 

for x > 2, show the existence of c; and c2 in (30.1). This finishes the proof of Theorem|30.1.1] 


Thus 


1.4 The nth prime 


We found an estimate for the number of primes less than or equal to a given number; we 
can use this bound to find an estimate for the nth prime number. 


Theorem 30.1.4: Let p, denote the nth prime number. Then there exist constants c3, c4 
such that 
c3nIn(n) < pn < can ln(n) 


for all n > 2. 


Proof. From Theorem |80.1.1| 


SB sna By me 
SO 
nIn(pn) Sy 2 nIn(pn) 
C2 Cy 
The LHS is at least c3n In(n) by the trivial bound n < p,. On the RHS, use the LHS of 


nIn(pn) 
cy 


again to get In(p,) < In ( i giving In(p,,) < cln(n) for some c. 
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Crash course in complex analysis 


complex-analysis Complex analysis is calculus on the complex numbers. The main functions of 
study are complex differentiable functions. 
Reference books: Lang or Ahlfors 


1 Holomorphic functions 


Definition 31.1.1: Let U C C be an open set and f : U — C bea function. The derivative 
of f is 


f'(2) = lim 


if it exists. f is holomorphic if its derivative exists at every point of U. f is meromorphic 
if it is defined and holomorphic on U except at a discrete set of points. 


Write f(x + iy) = u(x, y) + iv(x,y). Note that f being differentiable is a much stronger 
condition than being simply u and v being differentiable, because the limit of f as Az > 0 
along the real and complex directions must be equal: 


Ou Ov a! (5 rr =) 
dy | dy)” 


Ox ar Ox i 
Thus we get the Cauchy-Riemann criteria: If f is differentiable as a function of (x,y), then 
f is holomorphic iff 


Ox Oy’ Oy Ox 
Another way to think about complex differentiability is that holomorphic maps preserve 
angles (i.e. are conformal); we have 


f(ztre™®) — f(z) & re” f'(z). 


Because complex differentiability is such a strong property, holomorphic functions have 
many nice properties. Hence it is often useful to take functions defined on the reals and 
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extend them as far as possible on C. Some of the good properties are the following (to be 
explained in the rest of the chapter); note they are not necessarily true for real differentiable 
functions! 


e A function is holomorphic iff it is analytic (has a power series expansion). 


e A sequence of holomorphic functions with good convergence properties converges to a 
holomorphic function. 


e A bounded entire function is constant. 


e If two holomorphic functions agree on a set containing a limit point, then they are 
equal. Thus analytic continuations are unique. 


e Bounds on a function give bounds on the derivative. Hence we can “differentiate” 
asymptotic formulas. 


e We can expand holomorphic functions into products or sums depending on their poles 
and zeros—in much the same way that rational functions can be expanded into partial 
fractions or factored. 


2 Complex integration 
We now give two definitions of the integral. 


Definition 31.2.1: A path is a continuous function y : [a,b] — C. It is called a loop if 
y(a) = 7(b). Let f be a holomorphic function on U and ¥y be a path in U. 


1. If 7 is differentiable (except possibly at a finite number of points), define 
b 
[i@ae=f fooy'oa. 


2. Define an (indefinite) integral of f on a set V to be a function F’ on V such that 
F'(z) = f(z). Given holomorphic f, choose points to,...,t, such that there exist open 
sets U; D f(7((t;-1,t,])) such that f has an integral F; on U;. Define 


[oe SiO) = Bylylé-a)) 


Note that unlike in the real case, indefinite integrals may not exist globally, for example, 
Int is locally an integral for + but cannot be extended holomorphically to C\{0}. We need 
to establish the well-definedness of the second definition. 
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Theorem 31.2.2 (Cauchy’s Theorem, version 1): caucny: Let f be holomorphic on a closed 
rectangle R, with boundary OR. Then (using the first definition), 


7 =: 
OR 


From this one can show that integrals exist locally by defining 


where the integral is along horizontal and vertical lines; moreover one gets well-definedness 
in the second definition. 
We can now define the logarithm of a function. 


Definition 31.2.3: Let f be a holomorphic function on a simply connected set U (see 
Definition |31.3.1), with f(z) 4 0 on U. Choose zp € U and ao such that e% = z. 


mnia= [4 


Note different definitions of the logarithm will differ by integer multiples of 2772, and 
e(™)@) — f(z). The motivation comes from the fact that one would expect the derivative 
of In f(z) to be E (2). We write (In f)(z) to emphasize that this is not simply a composite 
of functions: We could have f(z1) = f(z2) but (In f)(21) ¥ (In f)(z2)." 

We seek a generalization of Theorem to meromorphic functions and arbitrary 
paths. 


(z) dz. 


3 Cauchy’s Theorem 


Definition 31.3.1: romotopic Two paths y and 7: [a,b] > C are homotopic if there exists a 
continuous map 
7ys(t) : [0,1] x [a,b] 3 C 
such that yo(t) = y(t) and y(t) = 7(¢). 
A subset of C is simply connected if it is pathwise connected and every loop in C is 
homotopic to a point. 


Theorem 31.3.2: Let U bea simply connected open set containing zg. Every path y around 
z in U\{z} is homotopic to a circle going around zp n times for some n € Z. This n can 
be calculated by 


ne WZ) = ee aes dz 


271 y2— 2 


and is called the winding number. 


‘Consider, for example, the case where f(z) = 2 on C\R<o, and 2 =i, z2 = —i. 
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Theorem 31.3.3 (Global Cauchy’s formula): Let U be a simply connected open set and 
f :U —>C be holomorphic. Suppose ¥y is a loop in U. Then 
1 z 
a ene) dz = W(7, 20) f (20): 


271 Jy 2 — 2% 


4 Power series and Laurent series 


As complex differentiability is a much stronger condition than differentiability for real func- 
tions, holomorphic functions enjoy nicer properties. The most important one is the following. 


Definition 31.4.1: A function f : U > C is analytic at 2 if it can be written as a power 
series in a neighborhood around 2p: 


(oe) 


f(z) = Do an(z — 20)”. 


n=0 


= NR) 


n! 


If f is given by its power series representation then we must have a, 
Theorem 31.4.2: A function f : U > C is analytic iff and only iff it is holomorphic. 


Note this is not true for real functions: for example, e- = has Taylor expansion equal to 0 
at 0, but is not the zero function. This kind of irregularity does not happen for holomorphic 
functions. 


Corollary 31.4.3: A holomorphic function has infinitely many derivatives. 


The following theorem says that for holomorphic functions, the radius of convergence is 
“as large as it could possibly be.” 


Theorem 31.4.4: radius-convergence Suppose f is holomorphic on a disc N,(zo) of radius r around 
zy. Then the Taylor series around zo converges absolutely to f on N,(z0). 


Proof. Estimate coefficients using Cauchy’s theorem. Complex Analysis, Lang III.7.3. 


We can generalize power series to allow terms with negative exponents. 


Theorem 31.4.5: Suppose f is defined on an annulus A = {z:r < |z— z| < R}. Let C 
be the circle of radius r’ € (r, R) around zp. Then f has a Laurent expansion on A: 


Fl) = So az— 2)", =f , i) 


= ie 
‘ n+1 
pane 277 z 20) 


If f is defined on {z: |z — 2| < R} then 
| 
(n) _ f(z) 
f(z) je dz 


2nt zZ—z)rth 
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The coefficient a_, is called the residue of f at Zo: 


Res, (f) = a1. 


The following theorem controls the size of the derivatives of a complex analytic function by 
its values of the function in a circle. Note that in the real analytic case we can’t make such 
a statement! 


Corollary 31.4.6: corcauchy-ineq Suppose f is defined on {z:|z— 29| < R}, and let C be a 
circle of radius r < R around zp. Then 


(2) < ™ max f(2) 


and the nth coefficient in the power series expansion satisfies 


Proof. Simply note that in the integral fo ain dz, the denominator has constant absolute 


20) 
n+l the numerator is bounded by max,ec f(z), and the arc length is 2zr. 


value r 


Corollary 31.4.7 (Liouville): A bounded entire function is constant. 


Proof. We can take r — oo in the inequality for n = 1 to find that f’(z) = 0 everywhere. 


4.1 Cauchy’s residue formula 


Using residues, we can state the most comprehensive form of Cauchy’s formula: 


Theorem 31.4.8 (Residue formula): :esiaue Suppose f is meromorphic on simply connected 
open U, and ¥ is a loop in U. Then 


[ fle)ds = 2m S> W(7,z) Res.(f). 


z pole of f 


One useful application of this is counting zeros and poles of a function f. 


Definition 31.4.9: Define the order of f at zp to be the least integer so that the Laurent 
expansion of f at 2 has a, 4 0: 
ord (zo) = m. 


Note that ord s(zo) > 0 signals a zero and ord (zo) < 0 signals a pole. 


Corollary 31.4.10: Suppose f is meromorphic on simply connected open U, and ¥ is a 
loop in U. Then 


ra [FO = DWC. pond). 
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Proof. If f has Laurent expansion a,,(z — 2)" +--+ at zp then f has Laurent expansion 


Mm (Zz — 2) > +++: 
am(e = 0)" Fo 


=m(z— 2%) +--- 


5 Convergence 


Unlike in the real case, holomorphic functions behave nicely under infinite sums and pointwise 
convergence. This is because by Cauchy’s theorem we can write f as an integral, and integrals 
preserve convergence. 


Theorem 31.5.1 (Holomorphic functions converge to holomorphic functions): Let {f,}°°, 
be a sequence of holomorphic functions on U. 


1. Suppose f, — f uniformly on compact subsets of U. Then f is holomorphic. 


2. Suppose >>°°., fn = f converges absolutely and uniformly on compact subsets of U. 
Then f is holomorphic. 


6 Series and product developments 


We know that locally, we can write a meromorphic function f as a Laurent series 77°. nx”. 
There are two other representations that are useful, depending on what information we have 


about the function f. 


1. If we know the poles of f, we can write f as a sum of rational functions 


) = Qala] + 912. 
2. If f is entire and we know the zeros of f, we can write f as an infinite product 


f(z) = 20 T] (1-2) eC). 


n=1 en 


(Think of this as “factoring” f, much like a polynomial can be factored as in the 
fundamental theorem of algebra.) These representations come about from convergence 
properties of holomorphic functions—so we can be sure the infinite products converge 
to holomorphic functions—and by Liouville’s theorem—if we engineer a function that 
is close enough to f then it must be equal to f. 
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Theorem 31.6.1 (Mittag-Leffler): Let z, be a sequence with lim, ... |Zn| = co (or a finite 
sequence), and P,, polynomials without constant term. 


1. (Existence) There is a meromorphic function f with poles exactly at z,, with Laurent 
expansion P,, ( -) +--+ at Zp. 


2. (Uniqueness) All such functions f are in the form 


> (P, (+) - an) +02 


n=1 a — an 


where Q,, is a polynomial and g(z) is analytic. 


Proof. See Ahlfors [AhI79| p. 187]. 


Warning: this does not converge for all P,. Typically we take Q, to the the first terms of 
the Laurent expansion of P,, (ah to ensure cancellation of high-order terms. 


Definition 31.6.2: The order of an entire function f is the smallest a € [0,00] such that 
IF(2)| Se eh 
for alle > 0. 


Theorem 31.6.3: product-development Let 2, be a sequence with limy.. |Zn| = oo. If f is entire 
with order a < oo with zeros 2, 22,... (with multiplicity, not including 0), then it has a 
product formula 


sd Bo z\2 \ \ BV 
product-formula | (Z = 2% A) Il (1 — =) €2n AC a mee) 5 (31.1) 


n=1 
where 

2 m= la), 

e r is the order of vanishing of f at 0, and 

e g is a polynomial of degree at most a. 
The product converges uniformly locally. Moreover, 
ae ae ed ec als (31,2) 
Conversely, if a = |a| and zx is a sequence satisfying (31.2), then the RHS of defines 


an entire function of order at most a. 


num-zeros 


Proof. See Ahlfors [AhI79| p. 195}. 


Hence the order of a entire function gives an asymptotic bound for the number of zeros.” 


2 function which grows faster is allowed to have more zeros—much like a polynomial with lots of zeros 
grows fast simply because it has higher degree. 
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7 Gamma function 


To prove basic properties of the zeta function in the next chapter, we need to know the 
properties of the gamma function. 


Definition 31.7.1: Define the gamma function by 
oo d 
(a) = i. ne” and Rs > 0. 
0 x 


We will begin by analytically continuing the gamma function and giving its basic prop- 
erties. 


Proposition 31.7.2 (Facts about [): gamma-tacts 


1. ['(s) can be analytically continued to a meromorphic function with poles —n,n € N, 
(—1)” 
ni * 


with residue 


8. py = se TI, (1+ 8) 8 


4. I(s+1) =sI(s) soT(n+1) =nl,n € No. 


6. - yP P (s+ nm) “al (s + m—1) = (20) "2 m2-™T (ms). In particular, [(s)P (s + i) = 


From the product development |31.6.3] we get the following. 


Theorem 31.7.3 (Product development of FP): gamma-product-development We have 


In the region 
R, = C\({s: arg(s) € [tm —e,7 +e]} U {0}), 


i.e. C with a wedge containing R<o deleted, we can define the function (InT)(s). By the 
product formula, it equals 


(nT)(s) =—ys —Ins+ 3 (5 —In (1+ =)) ; 


k=1 


The following asymptotic formulas will be useful. 
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Theorem 31.7.4 (Stirling’s approximation): stining Let P(t) = {t} — 5. For s € R., 


amty(s) = (s— 3)ins—~ s+ Sngan) — [* 2O 


1 
= (s — 5) Ins—s+ 5 In(27) + O.(|s|~*) 


1 —2 
Fay = ins gg + alls) 


I(s) ~ s°-2¢678\/2n 
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Chapter 32 


Dirichlet series 


diricnlet For proofs see |Apo94]. 


1 Dirichlet series, convergence 


Dirichlet series are the “power series of number theory.” As such, we will first need to get 
acquainted with their analytic properties. 


Definition 32.1.1: A Dirichlet series is a series of the form 


F(s) = 3 in) 


n 


where f(n) is an arithmetical function. Following convention, we let s = o + it, with o,t 
real. 
Let {A(n)} be a sequence strictly increasing to oo. A general Dirichlet series with 


exponents {A(n)}°°, is in the form 


Fig) = 3 fine, 


n=1 


An ordinary Dirichlet series has A(n) = In(n). ! 


Theorem 32.1.2 (Half-plane of convergence): Convergence: If the series 3°, | f(n)e~*™ | 
does not converge or diverge for all n, then there exists a real number o,, called the abscissa 
of convergence, such that 0>°., f(n)n~* 


e converges locally uniformly for o > o,, but 


e does not converge for 0 < d¢. 


‘A further generalization is given by the Laplace-Stieltjes transform, i e*'da(t), where a is a measure. 
The “step” part of a gives a Dirichlet while the continuous part gives a Laplace transform. 
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In fact, if the series diverges for all s with o < 0, then 


In |Sop_1 a(k 
oc = lim sup oe Pata os Mf 


Absolute convergence: If the series 7°, |e~**”)| does not converge or diverge for 
all n, then there exists a real number o,, called the abscissa of absolute convergence, 
such that (°°, f(n)n-* 

e converges locally uniformly absolutely for 0 > 0,4, but 


e does not converge absolutely for a < aq. 


In fact, if the series diverges for all s with o < 0, then 


ln? k 
a, = lim sup ne 


In particular, for ordinary Dirichlet series (that diverge when o < 0), 


0, = limsup rear lA), 
Noo 


2 Basic properties 
Proposition 32.2.1 (General facts): Let F(s) = 072, f(n)n-°. 
1. limg +o F(o + it) = f(1) uniformly 


2. (Uniqueness) If F'(s) = G(s) are absolutely convergent for 0 > o, and are equal for s 
in an infinite sequence {s;,} with a, — oo, then f(n) = g(n). 


3. (Non-vanishing in half-plane) Suppose F's) 4 0 for some s with o > o,. Then there 
is a half-place ¢ > c > o, in which F’(s) is never 0. 


Proposition 32.2.2: (Operations on Dirichlet series) operon-air Let F'(s) = 0°, f(n)n~* and 
G(s) = 4 g(n)n =. Then 


PGs) = 


where 


h(n) = (f+ 9)(n) = 3 Fag (=). 
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Proof. Formally, by grouping together terms where mn is constant, 


d(m) a(n) 


ne ns 


F(s)G(s) = 


joa = 


II 

iM: 
——_ 
3 

any 

Zz 

3 

3 

ll 

ca 


Since the sums for Ff’ and G converge absolutely, so does the double sum above, and the 
rearrangement of terms is valid. 


Theorem 32.2.3 (Euler products): cuter-proauct Let f be a multiplicative arithmetical function 
such that >°°., f(m) converges absolutely. Then when Rs > oa, 


Y flan" = TI (1 , fe) , fle) | ). 


If f is completely multiplicative, 


a1 


= f(m) _ 
Xe ior 


Proposition 32.2.4 (Derivatives): air-derivative The derivative is 
--~ n)inn 


Theorem 32.2.5 (Landau): ianaax Suppose f(s) is a holomorphic function that can be rep- 
resented in 0 > c by the Dirichlet series 


2 ny f(njn- 


with f(n) > 0 for all n > no. If F(s) is analytic in some disc of radius r around s = c, then 
F'(s) converges in 0 > o — € for some € > 0. 
Hence, F'(s) has a singularity at s = o-. 


Proof. We reinterpret in terms of power series and apply Theorem |31.4.4 
Take a = c+ &. Since F is analytic at in N,(a) C N,(c) U{z: Rz > c}, it equals its 
Taylor expansion there: 
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From Proposition |32.2.4, F(a) = (—1)* %°%, f(n)(Inn)*n-s. Plugging in and noting that 
the sum converges absolutely (since f(n > 0 for large n), we have, for s € N,(a), 


= S- f(nje®) Inn, —@ 


This converges for c—e € N,(a). But because it has nonnegative real coefficients, this shows 
CF L=s. 


Proposition 32.2.6 (Logarithms): Assume f(1) 4 0. if F(s) 4 0 for o > 09 > oa, then for 


O > 00, 
fap), 


Inn 


ts 
In F(s) =In f(1) + > Je 
Also talk about log diff of Euler product 


3 Dirichlet generating functions 


Definition 32.3.1: Let f : N — C be an arithmetic function. The Dirichlet generating 
function of f is 


To get the generating function of g(n) = Van f(m), by Proposition |32.2.2) we simply 
multiply by ¢(s): 


Hence by matching coefficients of 


we get the Mobius inversion formula. 
Table of dgf’s here 


440 


Number Theory, $32.4 


4 Summing coefficients 


Lemma 32.4.1: pir-menin For Y,C, i 02 


1 a , as Sue ( 1 ;) ee 
-— — min | ————, — 
Oni Jer 9 5 | 74 nT|Iny|’? 2)’ ! 
1 set'T ds 1 Sig 
—s , sonatas I = 
2nt i as | aT” 
1 petiT ds 1 
— *— — 1) < y° min | ————,, 1 St 
201 - s ee min (ay : : 


Proof. First suppose y < 1. Take d > c. By Cauchy’s theorem, since _ is analytic in the 


region below, we have 
cHiT ds d+iT ds d-iT ds ciT ds 
(es oes i 
c-iT S cHiT Ss diT 8 diT ” 8 


where the path of integrations are those shown in the picture. 


Hence, 


d+iT ds e-iT ds d-iT ds 
eg ee 
cHiT Ss d—iT Ss dil Ss 
d ido d-iT ds 

<2 i. Yat | y— 
c TD d+iT Ss 
Note that the last integral goes to 0 as d > oo, because |y*| = |y“| > 0. Hence, taking 


cHiT <ds 
aes 


: z 1 ct+iT , sds 
This gives |55 four we> 


cHiT ds 
s 
c-iT Ss 


d — oo gives 
love) y” 2y° Sy 


=a ~ Tiny T}Iny| 


< a Iny|. 


°The integral 54, [°*S°" f(s) is called the Mellin transform of f. 


44] 


Number Theory, $32.4 


By Cauchy’s theorem applied to the smaller segment bounded by kts = c and the circle 
with radius R = Vc? + T?, (picture) we have 


To ,aS| i , as 
nae G co’ s 
Cc 
< Ro = yy", 
since y < 1 and ¥ts > c on the arc. Hence |= cane SBI < fo 


For y > 1, take d < 0. Note © is analytic in the region below except for a simple pole at 
0 with residue 1 (since y* = 1 when s = 0). Hence by Cauchy’s Theorem, 


ctHiT ds d+iT ds d—iT ds c—iT 3 ds . 
[ore re 0 fo = 
c—-iT cHT S d+iT Ss d—iT S 


[INSERT PICCY] 


Then 
ctiT d 
[eve 


dil ,as c-iT , as d—iT , ds 
hae pa ae 
c+iT 8 d—iT 8 d+iT 8 
d—iT : ds 
(ove| 

dil S 


The last term goes to 0 as d + —oo, so the same argument applies as in the first part to 


lo oretiT , sds ys 
show Ee cir Ys -if< — aT iny’ 


By Cauchy’s theorem applied to the larger segment bounded by Rs = c and the circle 
with radius R = Vc? + T?, (picture) we have 


cHiT ds < ds 
| ore +/ y —=2m 
c-iT 8 Cc” Ss 
[ret] 
c-iT S C S 


Cc 


y 
< 2nRX = 2ny’, 
< 2nRF = 2ry 


Cc 


since y > 1 and ¥#s < c on the arc. Hence =u. 


c+iT 1s a 
= i —iT y 


Proof for y = 1 omitted. 


Corollary 32.4.2: sum-coc#pir The partial sum of the coefficients of a Dirichlet series is given 
by 


> dn + =(a € Ng) = = lim x f(s)— 


n<x Tt Too Jc-iT S 
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The error from truncating the integral is 


Ay N 1 ctHiT 2 
2 HF Ge SNe) - (Sef, ee 


443 


Number Theory, 832.4 


444 


Chapter 33 


Zeta functions and the prime number 
theorem 


zeta-l-pnt 


1 Prime number theorem: Outline 
Definition 33.1.1: Define the prime-counting function 
w(x) = |{p < x: p prime} |. 


Our goal in this chapter is to prove the following famous theorem (in all its error-bounded 
glory). 


Theorem 33.1.2 (Prime number theorem): pn There is an effective constant C' > 0 such 
that 
w(x) = li(w) + O(ae-CV™*) 


for all x > 1. 
Here li(x) denotes the logarithmic integral 
x dt 
li(e) = fo 
i) 2 Int 


Note that li(z) = {+O (a2) as © — oo, since integration by parts gives 


~~ Ine Ina)? 


li(z) = - a LO) = — i [ a +0(1) 


Lf L 
— ia a) (aa) .li-ibp (33.1) 
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1.1 The big picture 
We recommend Andrew Granville’s article [V.2 Analytic Number Theory in for an 


overview. 

How might we guess at the asymptotics for 7()? (In particular, why is it closer to li(x) 
than ;*.?) By studying tables of primes up to 3 million, Gauss hypothesized that the density 
of primes at around 2 is around ret and hence that the number of primes up to x would 
be the integral li(z) = f;’ “4. Making a table of 1(x) and the difference li(x) — m(x), we 
find that the difference is slightly more than on the order of \/z, so this seems to be a good 
estimate. 

It is a common theme in analytic number theory to make conjectures about the distri- 
bution of primes (or other subsets of interest) by assuming they are randomly distributed 
according to some probability model. Often a simple model works for simple asymptotics up 
to x, and the model needs to be refined or corrected when dealing with more complicated 
quantities such as number of primes in a small interval, or spacing between primes. 


Model 33.1.3 (Gauss-Cramér model): For n > 3, let X,, be the random variable such that 


1 
X,, = 1 with probability — 
Inn 
1 
X,, = 0 with probability 1 — —. 
Inn 


Then the sequence X, behaves similarly to the sequence 


a, = 1 if n is prime 


Gn, = 0 otherwise. 
The Gauss-Cramér model exactly predicts 7(x) ~ li(a). The model gives more than just 


the asymptotics of 7(a), though, it can also be used to think about primes in short intervals 
m(a+y)—1(2). 


Problem 33.1.4: What are the shortcomings of the Gauss-Cramér model? 


1.2 Main steps 


The main steps in the proof are as follows. 


1. When we have a Dirichlet series 


F(a)= So onan”, 
n=0 
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we can get estimates for ae dy, by “plucking out” those coefficients: The equation 


1 cHiT ds fn if Yy ol 
— hi anil = 1 7 — 
Sree ae 13) ify 1 
(9, iy < 1 
gives 
1 : cil ee 
a ee ao an + “(4 € No). 


We use the more precise statement giving error bounds (Corollary |32]32.4.2). 


We want a Dirichlet series where the sum of the first N terms is related to 7(V). Let 


c(s)= TI le 


—p prime 


We consider the function 


We use this function because ~(x) := °,,-, A(n) gives information on m(x), and —§ 
continues into a meromorphic function on C (since ¢ does). We now have the estimate 


1 c+iT t d 
ue) = oa 4 a + (error). 


2. We know ¢ has analytic continuation (Theorem |33.2.2). Hence we can move the path 
of integration to c < 0. From Cauchy’s integral formula, we get extra terms from 
the horizontal integrals (integrals involving =) and terms = from Cauchy’s integral 
theorem from the zeros of ¢(s). This is why we care about its zeros! Zeros with large 
real part contribute large error terms. We will need the following. 


(a) We apply the product development (Theorem |31]31.6.3) on €(s) = 77 2¢(s)T (3) 
to obtain C's) ; ; 
8 
: aves 
¢(s) Ps p 
(Theorem 33.2.5). 


(b) Using the above equation for $ 5, we calculate the asymptotics of N(7’), the number 


of zeros in {o + it : (o,t) € [0,1] x [-T, T]} (Theorem |33.3.2). 


(c) From (a) to (b) we get a zero-free region for ¢ (which includes #s > 1) (Theo- 


rems and |33.3.3). 
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From the zero-free region we get a bound for o as well as the horizontal integrals. 
If the Riemann hypothesis is true, then we can enlarge our zero-free region to ks > 5, 
which is even better. 


3. Finally we use the estimate for ¢(x) to get an estimate for 7(a) (Lemma|33.4.2). 


2 Riemann zeta function 
Definition 33.2.1: The Riemann zeta function is defined by 
nai 
when kts > 1. This will be generalized to L-functions L(s, x) in Definition [34)34.2.1| 
By Theorem [32.2.3] and by unique factorization in Z, we can write 
(= TT 


— n-s 
p prime 1 Pp 


By taking the logarithmic derivative, we have 


-F3 =D gad 9) = Deen 2 = Dope 


p p Dp 
Interchanging order of summation gives 
/ oe) 
log-diff-zeta — ¢ (s) => > A(n)n~*, Rs > 1, (33.2) 
¢(s) n=1 


where the von Mangoldt function A(n) is defined as 


oe ta n =p", p prime, r EN. 


0, else 


The most important property of ¢ is its analytic continuation and functional equation. 


Theorem 33.2.2: zctacontinues ¢($) can be analytically continued to a meromorphic function 
with a simple pole at s = 0,1. It satisfies the functional equation 


C(s) = 2(2m)SP(1 — s) sin (=) ¢(1—s). 
Letting €(s) = 7~2¢(s)P ($), we have! 
E(s) = €(1—s). 


Moreover, ¢(s) has zeros —2N (the trivial zeros); all other zeros are in the critical strip 
0<Rs <1. 


'The factor P (3) can be thought of as coming from the infinite place—see Chapter[36] 
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To prove this, we first need the transformation law for the theta function; we will show 
the functional equation for ¢ by writing it in terms of @. As we will prove a more generalized 
transformation law, we will postpone the proof for 6. 


Definition 33.2.3: Define the theta function by 
(a=, em Ru > 0. 


neZ 


Proposition 33.2.4 (Transformation law for @): tnetataw For all u with Ru > 0, 


This is a special case of Proposition |34]34.2.4 


Proof of Theorem We first analytically continue ¢ to Rs > 0, show the functional 
equation is true for 0 < Rs < 1, and use it to establish analytic continuation to C. 
Note 


+1 n+l 


1 -—s = 1 = —S_ ps 
ze de| = ax gee (n-°—a-*) dx (33.3) 


_ n 
continue-reta-to-0G (3) = + > [n= a | 
s—l n=l n 


Since forn <x2<n+1 we have 


x 
In — a *| = / eg da| < |s|n-2? 
n 


n+l 
| ni—ax2 fdr 


n 


< |s|n-*"*, (33.4) 


bound-zeta-summands 


the sum (33.3) converges uniformly locally for #ts > 0 and extends ¢ to an analytic function 
for Rs > 0. 
We claim that 


shemale) = i *(6(u) — ue, Rs >1 (33.5) 
Indeed, we have 
[Ow = Yul a = faye "a 
= ay f° ent ays a 
=2y) Per(S)S or) 
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The theta transformation law |33.2.4| give that for Rs > 1, 


2e(s) = f° (0(u) — ut + [(0(u) 1) 


= (a(-) - ae, (alu) — ui ee 
= (eZ) - 1)u' BO [wt w+ *(ou) — yu 


u 1 U 1 
2 


= 2 Fg Pou) — ry + PPO) - ype. 


The last expression converges for all 3s > 0, so in fact equals 2¢(s) for all Rs > 0 by 
uniqueness of analytic continuation. Since the last expression is symmetric under 1— s+ s, 
the functional equation for € follows. 

The functional equation for € gives 


a a oll =) 
P (3) 
1- in (8 
= 7°37 ( 5 *) i (4 - =) sin (Fa — s) by Proposition |31]31.7.2(5) 
T 
= 2(27)*" sin (=) T(1—s)¢(1—s) by Proposition |31]31.7.2(6) 


ey the statement about zeros follows from the fact that ¢ has no zeros with Rs > 1 
(as $ = is holomorphic there) and the functional equation, noting sin (=) =0 exactly when s 
is an even integer, with the zero at s = 0 cancelled by the pole at 1 of ¢. 


Theorem 33.2.5 (Product development of €):  x-proanet-aevelopment The function (s? — s)E(s) is 
entire of order 1, and €(s) has the product expansion 


A+Bs 8 P 
oo (1 = ) ae 
8? aaa p H P 


Then =(s) has the partial-fraction expansion 


C(s) =B-— + 5mm) - 55 (5 41); bs ( ae 


p nontrivial zero of ¢ a p 


From now on, unless otherwise specified, when we say zero of ¢ we mean nontrivial zero. 


Proof. Note (s? — s)€(s) is entire because € only has 2 simple poles at 0,1. To show it has 
order 1 we need two inequalities. 
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Step 1: There is no constant C’ so that (s? — s)€(s) x e@l’!: Indeed, for real s and any 
constant C’, by Stirling’s approximation |31/31.7.4) we have 


(s° — s)€(s) = (s* — sa (5) ¢(s) 


moh (8) rom. 
oe 2er 


Step 2: There is a constant C so that (s? — s)€(s) < eCls!lsl, elsimls| > 1 for all s so it 
suffices to prove this for sufficiently large s. By the integral and sum formulas for I and €, 
and the fact that |x*| = |x**|, we have 


on 


eo + ti)| <a FT (E 


) ¢(0), o> 1, 


C|s| In 


By symmetry of € is suffices to consider 0 > }. (“Nudging” |s| in e Is| by a constant 


changes it by at most a constant factor.) Consider 2 cases. 


1. o > 2: Then 1? < 1 and ¢(c) < ¢(2) so by Stirling’s approximation |31)31.7.4| 
IE(o + ti)| ST (=) — pllary(o)| — p($-1)Img-3+000) 


from which the result follows. 


2. 5 <o <2: From (33.4), we have for s bounded away from 1, 
es 3 
C(s) < O(1) + |s| }5 n7-2 = O(|s}). 
n=1 
This time [ (£) = O(1) so 


I(s° — s)€(s)| < 


s°n2C(s)T (5) _ O(\s\°) = eClslinist_ 


This shows (s? — s)€(s) has order 1. 


Step 3: By the product development 31]31.6.3} noting the the zeros of (s? — s)€ are the 
nontrivial zeros of ¢ (since I has no zeros and trivial zeros of ¢ come from the poles of [in 


the definition of €), we get 


(s? — s)&(s) =e**** TJ (1 = *) ep. 


p zero of ¢ 


Dividing by s? — s and log-differentiating gives 


eo. 4 1 4 
8s 
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c! i LI’ 7s i; @ i 1) 

—(s)=—-] PBS == 

c (8) ta) s s-—1l 2 =e 
i} 1I/s 1 1 1 P(z+1p) 
= —] ~—(—~+1)4+ B-— —— — LZi= = 
jnnt+5— (G+ )+ reps ere (z) : 


3 Zeros of zeta 


Note that from the function equation, ¢(s) has simple zeros at —2N. We call these trivial 
zeros. More importantly for us are the zeros with real part in [0, 1]. 

Denote by N(T) be the number of zeros of ¢ in {0 + it : (o,t) € [0,1] x [-T, T]}, count- 
ing multiplicity. We first give asymptotics on the vertical distribution of zeros of ¢ (von 
Mangoldt’s formula, Theorem 33.3.2), then give a zero-free region for ¢ (Theorem [33.3.3). 


Lemma 33.3.1: weat-zeta-zeros Define £(t) = In(|t| + 2). For s = 0 + it with o © [—1,2], we 
have? 


J i} 1 1 
ranatenamen oy =-4+5 (. | ) - O(L) (33.6) 
1 1 


=i D 


IS(s—p)i<1 ® ~ P 


+ O(L). 


Moreover, there are O(L) zeros p with |S(s—p)| < 1, i.e. the number of zeros with imaginary 
part in [é,¢ + 1] is O(Int), as t > oo. 


Note this gives N(T) = O(TInT). The next theorem will give an improvement of this 
estimate. 


Proof. Our strategy is this: at a point where we know © is bounded (s = 2+it), we use 
Theorem |33.2.5}to get information on how many zeros of ¢ can be close to s. Then we use 
compare *(¢ + it) with (2 + it) to get the general estimate. 


¢ 
Step 1: Theorem |33.2.5] gives us 
‘ 1 i] dt 1 1 
ciao (s) = — -+- B ++ lna7— (= ++ 1) +) ( -+ ) . (33.7) 
¢(s) s—l 2 2 1X2 7 err: P 
_———— _ 
O(1) (A) (B) 
From Stirling’s approximation |31]31.7.4| (A) equals 
if 
gamma2-estimate In is + 1 + is| + O(1) = O(L) (33.8) 


2Note — = O(1) when s is bounded away from 1. 
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These two equations show (33.6). 
Now suppose s = 2+ 7t. Note that 


C(2+it)) f= ig a 
C24) | [2 Mmm 
so the LHS of (33.7) is O(1). Hence (33.7) becomes 


1 1 
sotadcorosuma)(L) = S- ( + ) . (33.9) 


7 S=P Pp 


< 


We estimate the terms with |S(s—p)| < 1 by a constant to show that there aren’t too many 


of them. From (33.9) and (33.8), 
1 1 
o() = 8D (s- ' ) 


; pp 
a) ; (-) 
>R ( since R{ —]}] > 0 
2 ORFF F Spp : 
1 
= 1 < < 
2 apo since0< Rp <1 


1 1 
savom > s]{p 18(8—p)<1}l+5 YO (33.10) 


5 ig¢s—pisi E — Sp)? 


This proves the second part of the lemma. 


Step 2: Now we consider general s = o + it, by comparing it to 2 + it. We have by (33.7) 


and (33.8) that 


s s ; 
=-(s) — (2 + it) 
¢ ¢ 
O(1) 
1 l/. jo t 1 1 
a ot (10 oe -In|2 |) + ( —— ) 
Po laa aah Be oe by s—p 2+it—p 
ed 
01) 
1 1 1 (2—c) 
a Sag 8 =p a (s — p)(2+ it —p)’ 
IS(s—p)|<1 %°~ P — [B(s—p)I<1 P  |3(s—p)|>1 p ie 


O(L) O(L) 


The first O(L) is because there are at most O(£) terms and each term is at most 1 in absolute 
value; the second is from 


2=0 1 
ar a Sear ) = 01) 
os (s — p)(2 + it — p) (=. S(s — + 
the first equality is from 2—o0 = O(1) and S(s—p) = S(2+it—p); the second is by (33.10). 
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Theorem 33.3.2 (von Mangoldt): ceta-reros(*) As T’ > 00, 


N(T) = 


ig T cj 
In ( ) ——+O(nT). 
27 T 
Proof. As © has only a countable number of zeros, we may assume TJ’ is not the imaginary 
part of any zero. 
Let 


R= {e+ ts (2,4) € |=1,2| & 27 ]} 


and let C be the boundary of R. (PICTURE) From €(s) = 7~2¢(s)I (4), we see that € has 
the same zeros as ¢ in this region, and simple poles at 0 and 1. Hence by Cauchy’s residue 


formula |31]31.4.8| 


&'(s) _ 
mn “ihe wy = 2N(T) — 2. 


Noting that €(s) = €(s) and €(s) = €(1 — s), changes of variable show that the integral on 
each of the sections of C' between 2, 5 +77, —1, and 5 — iT are the same.? Let C’ be the 
part from 1 to 5 +iT. Thus the above equals 


&(s) 4, 2 ve a +f Cs) oe as (ian fx)’ -yf 


milo &(s) wa Teas Se k=1 


s (expression is real). 


We break this up into 3 integrals and estimate each part separately. 
1. S for —Btds =—FInz. 


2. Using the estimate for © in Lemma\|33.3.1| we evaluate the second integral. Note that 
In¢ is defined for Rs > 1 and is uniformly bounded for Rs = 2: 


(In ¢)(s =» In(1 — 
p prime 
|(In¢)(2 + it)| < So 2p. 
p prime 


(Just bound In linearly near 1, or expand in Taylor series.) Note In(a—p) is well-defined 


3We used € because its symmetry allows us to do this. 
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on C’ for any p. Hence by Theorem |33.3. 1} 


3 C8) ds = (S(In¢)(2 + iT) — S(In¢)(2)) + £+iT eC 


ae (s) ds 


2+iT ¢ 


3+iT 1 
— Ce 
=o f"3(_S ) + 00m) a 


s—p)|<i °~ P 


=O(nT)+ YO B(nle= per 


IS(s—p)|<1 


< O(InT) + 270(InT) 
since there are at most In 7’ terms in the sum. 


3. We estimate the last integral using Stirling’s formula |31]31.7.4| (Note that InT is 
well-defined for s € C’.) 


Now put everything together to get 


N(T)-2==(-Finr + 0(m7) + (Fn (5) - 5 +000) 


N(T) =m (=) 7 7 +O(InT). 


Theorem 33.3.3 (Zero-free region for €): zeta-zero-ree There are no zeros of ¢ with Rs > 1. 
Moreover, there is a constant c > 0 such that for |t| > 2, every zero o + it satisfies 


Cc 


<1—-— —. 
° In |t| 


PICTURE! 


Proof. We already noted ¢ has no zero for Rs > 1 (Theorem |33.2.2), so for the first part it 
suffices to prove that no zero has real part 1. 
If ¢ had a zero 1+7t, then ra would have a pole of positive residue at 1+7t. For s = a0 +it, 


ao > 1 we have —§(s) = Vr Ae) so this means that as 0 > 1+, many of the important 


—it « 


terms would have n close” to —1, to make it blow up in the negative direction. For those 
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terms, we have n~?” “close” to 1. This would force —£(o + 2ti) to have a pole of positive 
residue at 1 + 2t2, i.e ¢ to have a pole at 1 + 2ti, contradicting the fact that it is analytic 
there. 

We now make this idea precise. What we want is an inequality between some function of 
an angle and its double, so that if one is small it forces the other to be large. So we consider 


0 < 2(1+ cos@)* = 3 + 4c0s0 + cos 26. 
This gives . . 
0<3+4R(n-“) + Rin). 
Multiplying by A(n)n~? and summing, we get 


NOM ee (-<()) +49 (-St6 ‘i ti) +R (-t " 2) | o> 1. (33.11) 


Letting r be the degree of the zero at 1+ tz, we have by Lemma/}33.3.1 


3 4 
0< (— i o(1)) = (= “+ o()) +R (-Sc + 2) eo 
If r > 1, then this gives -$(6 + 2ti) + oo as o > 1, contradiction. Hence r = 0; 1+ it is 
not a zero. 

For the second statement, we have to use the partial fraction decomposition [83.2.5 


Suppose p = (1 — 6) + it is a zero. By Lemma )33.3.1} we have 


_ G(s) _ ee eee ee 
say = Ole ~ (<= +5) S Ole — 
Then 
© (o +ti) < O(n t}) - > — 
¢ 7 g+o=1 
-#&( + 2ti) < O(In|2¢|) = O(In|tl). 


For o > 1, plugging this into (33.11) gives 


3 
< —— l — 
=< soy ft Ola Ht) 


gro—1 
4 3 
= < Ci In |t 
o+6-1 ra er) 
for some C,. Now take o = 1+ 40 to get 
4 3 
—< —4+C,In|t 
po ap 
giving 
CS ! 
20C; In |t| 
as needed. 
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4 Prime number theorem: proof 


Now we gather everything together to prove the prime number theorem. We first show the 
following. 


Theorem 33.4.1 (von Mangoldt’s formula): von-Mangoiat-formula For an integer x > 2 anda > T, 


miah=o > 26 (a) (33.12) 


Is(pi<T P 


Proof. Step 1: We estimate v(x) using Theorem 32}32.4.2 Suppose x is an integer; the 
theorem gives 


pe - (fe (-£@S))]/<am+ E (ames 


iT S ine T {In (£) 
x\¢ In(n) 
<1 + —) ————. 
n(x) wey Aa) T |In (2) 
Take 

1 
cS 1+ 
Ing 


Note that this makes x° = ex = O(x). To estimate the sum we split it into several parts. 


1. 1<n< #?: We have 


1<n<é l<n<a 
x(Inz)? 
T 
2. <n < ez: We have 
S- =) Inn i > 1 Inz Inn 
= <n<ex, n£#x tt - lin (2) é<n<ex, n£x ih lin (2) 
1 ] 
a. — using nz ~x—-—1whenre1 
T eo an<ex, n#x a 
x zlnx > 1 
~ OD é<n<exnfAr In ~ x| 
zing i 
SF > . 
1<n<(e-1)z 
x(Inz)? 
T 
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3. n > ex: We have 


yy (=) sal < | = d my decreasing for y > e 
wen IY T T Jex—1 ye Cc 
a =a ing a 
L om 1 7 (c -_ 1 ex—1 
x(Inx)? 
Ae 


Putting everything together gives 


va) — (["e (-S) 2)]=0 (“E+ ma). (33.13) 


von-M-1 
iF ¢ S$ 


Step 2: We move the line of integration to #s = —1. Assuming that T is not the imaginary 
part of any root, by Cauchy’s residue theorem |31.4.8] PICTURE 


ere ar arte ae Ge Si? gC cir gC ¢' xP 
—=(s)dst fo 2 (s)ds+ fo De els) ds = =(0)- — 
few EO ee COM Lm COM eg COE= FO DE 
a, el 

In Ly Th,2 


Here - are the resuides at the zeros, —x comes from the pole of ¢ at 1, and <(0) comes 


from the pole of , Then 


cHiT x8 Cc ba 
wonaee ff a (-S) ds =.= 1 + Tha + Tho + I, — S> =r (33.14) 
c-iT § C Sper P 


We estimate each summand. 


1. For the horizontal integrals, we use the estimate |33.3.1]to get 


+O(0nT), s=o4+Ti 


IS(s—p)|<1 ® — P 
1 


< — 
ae—plcr SIS — P) 


+O(nT). 


We would like to bound S(s — p) away from 0. To do this, note that there are O(InT) 
roots in with Sp € [T,T +1] by Lemma|33.3.1| Hence by tweaking T slightly*, we can 


4Changing T by a constant does not change the error term of (33.12); moreover the change in the LHS 
; 2 
sum is O(#InT) = 0 (242°), 
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assume |S(s — p)| > ;& for all p. Also by Lemma |33.3.1] there are at most O(InT) 
terms in the sum, so the sum is O((In T)). Integrating gives 


i. Ti 8 wc, \ds 


+T% 8 


~ O((mT)2)O (=) - la*| ds 


2. For the vertical integral, we use the same estimate, this time noting that |s — p| > 1 
for every root p, since every zero satisfies Rp > 0. This gives that ©(s) = O(InT), and 


ie ee s) ds 


147% 


-14Ti y—1 
= O(InT) / Eady 


-1-Ti |s| 


nT 1 
= O | —— dt 
( x ) -T Vt? +1 
T+1 
o(=) f Sai 
x 1 t 


-o( (2) -0 (242) 


Equations (33.13) and (33.14) together with the above two estimates give the theorem. 


The final ingredient in the proof of the Prime Number Theorem is the estimate for 
Liswl<T - using the zero-free regions for ¢ and the estimate for number of zeros of ¢. 


Proof of Theorem First, note there can only be a finite number of zeros of ¢ with 
IS(p)| < 2, 80 Viaipyj<e . = O(2") for some fixed r < 1.° We estimate Dy<ig¢pj<r ~ in two 
steps. 


1. By Theorem |33.3.3} there is c such that for p with 2 < |S(p)| < T, 


elnz 


a? | = = iP < gh nr =7Te mT, 


In fact, there are zero such zeros. 


459 


Number Theory, $33.4 


2. Using N(T) = O(T InT) (Theorem |33.3.2|or the weaker remark after Lemma |33.3.1), 


1 seiiyer 50) 


a<isicr ll ~ s<isqier SP) 
Td ant) ; tof 
< (Riemann-Steltjes integral) 
2 
= ry" N(2) N(t) . 
ge +f 2 dt integration by parts 
Int 
= O(InT) + fo (= ) ae 
pnt-step2 = O(n T) T O( ((In ry ‘= O((In ieee (33.15) 


Putting these two estimates together, 


xP 1 

—|< O(z")+ max (|x|) — 

Isini<r P 2<I9@)I<T "oe S@yjct IPI 
< O(a") + O (xe ‘aT (1 nT) ) .xrhorho (33.16) 


Combining with Theorem |33.4.1| and setting T’ = eV™® (so that rewmT = 7), We get 


p(x) —z| =O («" 4+ ve nt (InT)? + oe 


=O (x" + ce VE Ing + x(In ae) 
= O(xe~CV™*), 
for some C' > 0. This shows 
psiasymptotict)(z) = 2 + O(ae °¥™*), (33.17) 
Finally, we extract the asymptotics of 7 from the following. 


Lemma 33.4.2:  partiatsum-pi We have the following estimates: 


(2) = 92 4 Pu, +o), 


Inaz y(Iny)? 
w(x) = 7(x) Ine — is an dy + O(a? Inz). 


Proof. Define 


(1 nm prime, On n, 1 prime, 
y(n) = . Ay(n) = j 
0, m not prime, 0; nm not prime, 
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and 


First note 


W(z)-w(x)|= >) > np 


2<r<logs(x) p| p’<x 
1 
> ar lna 
2<r<logs(x) 


O(x2 In xv + 3 (In r)”) — O (x2 In Lr) +psi-psil-estimate 


IA 


Part_1: By partial summation 3]3.7.1 with u = Ay, U = yy, and v = — 


1 
Ing? 


Combining with (33.18) gives the result. 


Putting (33.17) into Lemma [33.4.2 


x re CVine © 1 ge Cyr" 
eg ea ing +O — a (oa +O (Sor )) dy + O(x 


= li(z) + O(xe~CV™*), 
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5 The Riemann hypothesis 
The following conjecture is worth one million dollars: 


Conjecture 33.5.1 (Riemann hypothesis): All nontrivial zeros s of ¢(s) satisfy Its = 5. 


Note that for no ¢ > 0 has it been proved that all zeros satisfy Rs < 1—e. Our zero-free 
region, sadly, has a boundary approaching real part 1 as t > oo. 

One reason that the Riemann hypothesis is important is that it gives a strong error bound 
in the prime number theorem (as well as many other theorems of analytic number theory). 


Theorem 33.5.2: Suppose 5 <6@< 1. The following are equivalent. 
1. ¢(s) has no zeros with Rs > @. 


2. r(x) =li(x) + O(x* Inz). 


3. m(x) = li(z) + O(x***) for every ¢ > 0, where the constant depends on e. 
In particular, the Riemann hypothesis is equivalent to (x) = li(v) + O(x?2 Inz). 


Proof. (1) = > (2): Suppose ¢(s) has no zeros with Rs > 6. Then using the estimate 
in (33.15), we have 


S- 2 <max|x?| >> 
soir PP sinner lol 
< x (InT)?. 
Now take T= «x to find that 
l 2 
jew) 2] = 0 ("an7y? + ZP) 
= O(x°(In2x)?). 
Then using Lemma and (33.1), 
_ (z) e dy 1 
me=Ta tf (Ogg tow) 


=li(z) +0 (aa) + fo (S| dx 
= li(x) + O(a Inz). 


(2) => (8): Item 2 is stronger than item 3. 
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(3) ==> (1): Going the other way in Lemma|33.4.2} 
_ — fe ay) 1 
wus) = 7a) Ine | - dy + O(a? Inz) 
a « dy he ) ef 1 1 ry dt Oy) i 
= ! ! ©) | Ing — ! + O(a? l 
(s u (In y)? ee | Ine “al (Int)? ” y aa ee) 


/ « dy « dy «fry dt | 
= O+e") _ ae 
a ae } Iny y (In y)? mae (/ (lint joy dy 


0 


for any <’ > €. Note the integrals above sum to 0 by integration by parts (u = Iny, 


dv = tay): 
By partial summation, for o > 1, 


-£0) =D Ann 


II II 
| 
meer 

+r 

es 
WD 
3 
ne 
wH 


i (w(x) — x) 278" de. 


O(a? te’) 


The last integral converges whenever o > 0+ €’, so © has analytic continuation to a > @. 
This means ¢ has no zeros for a > @. 
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Chapter 34 


L-functions and Dirichlet’s theorem 


1-func-dirichlet 


1 Outline 


Our goal in this chapter is to study the asymptotics of 
nm(z,a mod N) =|{p<z:pprime,p=a (mod N)}| 


where a is relatively prime to N. We define (x,a mod N) = Yin<x,n=a(moa N) A(”)- 

To study the distribution of primes in the arithmetic progression n = a (mod N), we 
study the asymptotics of w(2,a mod N). However, this does not come from a Dirichlet series 
that we can easily estimate and that has nice multiplicative properties, like w(x) comes from 
¢(z) =II, a (after logarithmic differentiation and extracting coefficients). 

The solution is to write w(xz,a mod N) in terms of Dirichlet series whose coefficients are 


multiplicative. For example, when considering primes p = 1 (mod 4), we consider 


tee He a 1 
i — — 
Page gee ee gs lI ime 
ae ee ar ran 1 1 
Wa) ae a Pee ee gg 


ie 2° oF 8 67F OS p=1 (mod 4) oa p=3 (mod 4) ae 


The multiplicative structure is from the fact that the coefficients come from group homo- 
morphisms (Z/NZ)* — C, i.e. Dirichlet characters (see Definition |13]13.1.8). 
Logarithmic differentiation gives 


rf _ AQ) , (3), AG), A(7) , AQ) 
Ac a ir a er 

ra AQ) A(3) AQ) AZ). AG) 
Aa i a 

eae v A(1) A(5) A(9) 
9 (-Zlx) ~ 7 (5.2)) = 1s = a a ‘on 


Number Theory, §34.2 


Taking the partial sum of coefficients of the last Dirichlet series gives the desired result. In 
general, we can always estimate W(xz,a mod NV) using an average of these L-functions. 

The main steps in the proof are the same, except with ¢ replaced by L and an extra 
recombination step at the end using character theory. The main steps are the following. 


1. Functional equation and analytic continuation for L, Theorem |34.2.5 

2. Product development, Theorem |34.2.6 

3. Estimates on ee and asymptotics on number of zeros N(T, x), Lemma}34.3.1 
4. Zero-free region for L, Theorem |34.3.3 


5. von Mangoldt’s formula }34.4.1 


If we conly cared about bounds for a fixed modulus N, then that’s all there is to it. 
However, to obtain error bounds independent of N, we need a zero free region indepen- 
dent of N (Theorem (34.3.3). While in Theorem we had the luxury of restricting 
to large |t|, here we have to work with small |t], and our resulting region may miss an 
“exceptional” zero. We show there is at most 1 exception (Theorem and prove a 
version of the Prime Number Theorem for arithmetic progressions (Theorem . Later 
we prove a stronger but ineffective bound on the “exceptional zero” (Theorem |34.5.4) and 


obtain improved asymptotics (Theorem |34.5.1)). 


2 L-functions 


Definition 34.2.1: Let y be a Dirichlet character. Define the L function 
L(s,x) = 30 x(n) Rs > 1. 
n=1 n 


By multiplicativity of y, D has a product expansion 


1 
Mex = pp 


Only the factors with p+ N contribute. Note that if y is of level N and y = yix2 with x1 
primitive of level N,, then 


in-termeof-primitivels( 3, x) = L(s, X1) Il (1 —= x(p)p *). (34.1) 
PIN, Pini 


Thus for convenience we can often just prove results about primitive characters. 
By logarithmic differentiation we have 

L (Inp)x(p)p* 

Z(s,x)=—y Pixie” 


JA(n 
L 1—p 8 


a y 
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Theorem 34.2.2 (Generalized Poisson summation): gen-p; Let g be a function Z/NZ — R, 
and suppose f is a C? function satisfying 
f(z), |F(@)| < C+ [alt 
for some Cd > 0. Then 
m ra is: 
X F(Z) 9m) = ¥ Fenjarn). 


meZ nezZ 
In particular, if y is a primitive multiplicative character modulo N, then 


XY xom)s (F) = S0oxt) X -n) Flo), 


where x7 (k) = en, 


Here f(n) denotes the Fourier transform 
fy) =f faye de 
and g(n) denotes the finite Fourier transform 
g(n)= SY) glme 
m (mod N) 


Proof. Consider the function 


F(x) = Yo flw@+m). 


meZ 
Note this sum converges absolutely to a continuous function by the given conditions. Since 
F(x) has period 1 and is continuous, we can expand it in Fourier series: 


ioe) 
F(z) = S- ine 
n=0 


an = - F(w)e-2""® dy = a dX f(a + mer"? dx = [. f(a)e Pr"? da = f(n). 


meZ ee 


Plugging in = 7 gives 
a ‘ si oe 
Bi |= S- finyerrn(#), 
(y) neZ 
Now we calculate 


SA(Z)am)= YL gar (<) 


meZ a(mod N) 
= LY 9a) X faye) 
a(mod N) neZ 
=D flr) YL glayerrn) 
neZ (mod N) 
=¥ f(n)g(n) 
neZ 
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For the second part, note that 


X xm) (=) = X Un) fm) 
= GO. xt) x) F(n). 


We apply Poisson summation to derive a transformation law for generalized theta func- 


tions. 
Definition 34.2.3: Let y be a multiplicative character modulo N. Define 


Note we need to work with V,(u) when x is odd, since in this case 6,(u) = 0 and we 


cannot express L(s, x) in terms of 6. 
Proposition 34.2.4 (Transformation law for @)): teta-transtorms Suppose x is primitive. Then 


tly = So a 
ay(u) = ~SeaM9, (ae). 


is itself; moreover, if f(x) = g(ax) then f(y) 


—ra? 


ay? 


€ uN? , 


Proof. Note the Fourier transform of e 


g (4). Hence 
1 
- —ru(N2a)? = 


By the Poisson summation formula|34.2.2| 


For the second part, note first that fily) = 2nixf(y). Hence 


2 1 d 2 
—mu(Na)*) _ ( _ pe —Tu( Na) —— ‘ 
ite ) ( saw) (= a ) 2tuN NJu 
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Then by Poisson summation, 


From this we get the functional equation for the L-function. The proof is similar to that 
of Theorem [33.2.2 


Theorem 34.2.5: tcontinues Let x be any character modulo N. Then L(s, x) has an mero- 
morphic continuation to C. If y is principal then L(s, x) has a single pole at 1, and if x is 
nonprincipal then L(s,y) is entire. 

Now supose x is primitive. Defining 


§(8; x) = (~) * D (- 5 “) L(s,x); 
where 
_ fo, ifx(-1) =1 
— 11. if x(—1) = -1, 
we have P n 
E(s, ) = 0% a dec _ 8,X): 


Moreover, for any y, L(s, x) has zeros at —2N + a (the trivial zeros) and all other zeros 
are in the critical strip 0 < Rs < 1. 


Note that for y nonprincipal, partial cancellation in the Dirichlet series removes the pole 
ats=1. 


Proof. Note that it suffices to prove all statements for x primitive, in light of (34.1). If x is 
principal, the result follows from the result for ¢, so suppose y is nonprincipal. Use partial 
summation to find that for for s > 1, 


bousd-ti-eummandel (3, xX) = S(a) sxe! dx (34.2) 
1 


where S(x) = Yn<» x(n). (We use the fact that limy_,.. S$(N)N~* = 0 when s > 1.) Since 


x(1) +--+ +x(N) = 0 by Corollary }13}]13.1.7) y(1) +---+ x(n) < N. Then for Rs > 0, the 
above integral converges absolutely, extending L(s, x) holomorphically to Rs > 0. 
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Case 1: Suppose y(—1) = 1; then y(—n) = x(n). We calculate 


oo s du 
[ Oy (uu an 


in two different ways.! When 0 < Rs < 1, 


Now using the transformation law |34.2.4| 


- sdu re G(x, x) ( 1 ) ; du 
i} ae ah NVJu Ox Nu) wu 


_ BOX 1 ) sidu 

~—N i ax ( - 

— 2G (x, xT) = of mn? = 
N 


= SENT ine fe af 2 du mn? 
=P EY [Rae (=a) aC oe 


2G(x,xf)m2-2 Xn) peo _, iss du 
= Ns 3 aa eu? " 


2 ee = 
= COoxis Lil-s,x)r ( *) . 


Equating these two calculations gives the result. 
Case 2: Suppose x(—1) = —1. We work with J, instead of 6,. To compensate for the extra 
factor of n in Vy, we need an extra factor of u2. We calculate 


Unlike in Theorem |33]33.2.2| there is no “—1” since x(0) = 0. 
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in two different ways. First, 


=23° [ xnyneu  —nx(—n) = ny(n), x(0) = 0 


= Qn? L(s, paik ( 


Now using the transformation law |34.2.4| 
oo du )iy 1 ) s+1 du 
a a = 
I u ie en € Qu) ° u 


_ Cex a 1 \ 3 idu 
a N?2 iy 0: (sa) 4 u 


2G (x, x7)é 
ao 


Again matching the two calculations gives the result. 

From Proposition 5), T has no zeros, so we find that L(s, x) is defined whenever 
L(s,X) is defined; this L is entire. The description of the zeros of L follow from the functional 
equation and the fact that [ has poles at —Np. 


Theorem 34.2.6 (Product development of €(5,)):. x-chi-product-development SUPpose y is primi- 
tive of level N > 1. The function £(s, x) is entire of order 1 and has the product expansion 


€(s,x) =€(0,x)e"* (1-2) ef. 


p zero of E(s,x) P 


Then 4 Vis, x) has the partial-fraction expansion 
/ 


L 1 N 1’ s+a 1 1 
= =B ] — Ws 
p (2x) +5m(7) ral 2 )+ p> (<+5) 


p nontrivial zero of ¢ 
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From now on, we only talk about nontrivial zeros of ¢. 


Proof. We proceed as in Theorem |33]33.2.5,| The argument is the same, the only major 
differences being that €(s, x) has no poles at s = 0,1, and the slight difference in definition 
of ¢(s, x) in terms of L(s, x), versus the definition of €(s) in terms of ¢(s). (Namely, we have 
_ sta 
s +a instead of s, and an extra N~ 2 . For completeness we give the proof. 
To show it has order 1 we need two inequalities. 


Step 1: There is no constant C' so that €(s,) x e@!*!: Indeed, for real s and any constant 
C’ we have 


Step 2: There is a constant C so that €(s,y) X eClsimls!; els!ls| > 1 for all s so it suffices 
to prove this for sufficiently large s. By the integral and sum formulas for I’ and €, and the 
fact that |x*| = |x|, we have 


ota 


(otis (=) 7 r( 


o+a 
2 


) aes, o> 1. 


By symmetry of € is suffices to consider o > $. (We have €(s, x) = Soe E(1 — s,X), and 
the multiplier has absolute value 1.) Consider 2 cases. 


l.o > 2: Then w= < 1 and L(o,x) < ¢(2) so we have by Stirling’s approxima- 
tion |31]31.7.4] that 


|E(o+ti, x)| x Nee (a) _ nF eldnry(o+a)] — yy $2 ¢(Zts=2) im 4-24.40) 


from which the result follows. 


2: 5 <o <2: For s bounded away from 1, from (34.2), 


L(s,x) = O(|s)). 


This time T (4%) = O(1) so 


IL(s,x)| < = O(|s|) x eClslin|s|_ 


This shows €(s) has order 1. 
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Step 3: By the product development |31.6.3} noting the the zeros of €(s, y) are the nontrivial 
zeros of L(s, x), we get 


€(s,x) =€(0, xe" TT (1-2) ef. 


p zero of L(s,x) B 


Logarithmic differentiation gives 


sta 


Since L(s, x) = (=) 2-T— (=a) &(s,x), we get 


/ 
+ (sx) =5n(™) - 55 a 


3 Zeros of L 


Lemma 34.3.1: weak-t-zeros Define £ = In N(|t| + 2). Let x be a primitive character of level 
N. For s=o + it with o € [—1, 2], we have 
[' 1 1 
(6.x) =D (5 +5) +00) 
L a eR op 
3 1 


|3(s—p)|<1 2 ~ P 


+ O(£). 
Moreover, there are O(In|Nt]) zeros p with |S(s — p)| < 1, i.e. the number of zeros with 
imaginary part in [t,t + 1] is O(In Nt), as t > oo. 

Note this gives N(T) = O(T In(NT)). 


Proof. We follow the proof of Theorem |33]33.3.1| The case N = 1 follows from there so we 
assume NV > 1. 
Step 1: Theorem |34.2.6] gives us 


ves 1 N\ 1I’s+a 1 1 
-zero-sum ~—_ 5 = B ] ( ) —_ = . 34.3 
L2 7 (8x) rae - aT 5 i ara) (34.3) 
ee n—=_S_-_“_Y 
O(1-+In N) (A) as 
From Stirling’s approximation |31}31.7.4| (A) equals 
ota t 


+ O(1) = O(L). (34.4) 


Lneatiniad-estimatel | 


T 5? 
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Now suppose s = 2+ 7t. Note that 


—(3,X) < < OO, 


i _ 
7 _ 


: x(n)A(n)n-2-# 


Yn n)n~? 


so the LHS of (34.3) is O(1). Hence (34.3) becomes 
1 1 
La-uero-wumal) (L) = S> ( + *) . (34.5) 


a \e=p Pp 


Now finish the same way as in Theorem |33]33.3.1]to conclude the first step. 


Step 2: Now we consider general s = o + it, by comparing it to 2 + it. We have by (34.3) 
and (34.4) that 


ye L' : 1 1 (2—0)+it 
—(s,y)-—(2 + it) =O(1)+ S- S> —_—— + S- : : 
L ens e—picr SP ja(erplca 2 FHP jayerpyar (S$ — IZ + tt — p) 


O(1) 
O(L) O(L) 


Finish as in Theorem |33}33.3.1| the only difference being that In |t| is replaced by In| N¢]. 


Theorem 34.3.2 (von Mangoldt): tzeros(*) As T + 00, 


T. (NT\ T 
N(P,x) = = In ( ~ ) -—+0(nNT), 


where the constant is independent of N. 


Proof. The proof is similar to Theorem |33]33.3.2| We'll only need the weaker estimate 
N(T, x) = O(T In NT) so we omit the proof. 


Theorem 34.3.3 (Zero-free region for L): t-zero-tree There exists a constant c > 0, independent 
of x and N, such that the following holds for all primitive x of level N. 


1. If y is nonreal, and s = o + it is a zero of L(s, y), then 


L-zero-boundO < 1 — . (34.6) 


2. If y is real, then with at most 1 exception (counting multiplicity), all zeros satisfy (34.6). 
If it exists, the exceptional zero is real. 


Unlike in Theorem |33]33.3.3} we have to worry about small |t|. Fortunately, L(s,y) has 
no pole at s = 1 to screw us up. Things are not so easy, however. 


ATA 


Number Theory, 834.3 


Proof. We may assume N > 2. 
As in Theorem |33]33.3.3| we have 0 < 3+ 4cos@ + cos 26, so 


0 <3 + 4R(X(n)n™) + RO(n)'n™). 


Multiplying by A(n)n~? and summing, we get 


L L L’ 
java Hiestiinequality) < 3 (-Z0. .0)) ++ AR (-Z0 -+ tt, 0) ++ RK (-Z0 -++ 2ti, °)) ; oO = 1. 


(34.7) 
Suppose 1 <o < 2 and p= (1 —6) +i is zero. First we have 
L! ¢! (Inp)p? 
2flil — — = —- — = + O(In N). 34.8 
e ZT, (% Xo) ¢ (7 Xo) x 1_-p aa (In N) ( ) 
Next, we use the partial fraction decomposition |34.2.6| By Theorem |34.3.1] we have 
L' 
oniodt (-Z6, xX )) <0 “oor (— =). (34.9) 
1. Suppose x? is not principal, i.e. y is not real. Now (34.9) gives 
n ( eT + ti ) m OLE) : (34.10) 
i eS o+6-1 
Also by Theorem |34.3.1 
L' 
pro eetnoeli Nt (-Z¢ + 2t2, °)) < O(L(2t)) = O(L). (34.11) 
The remainder of this case follows the lines of Theorem |33}33.3.3 
2. If x? is principal, then we have 
! 3 C p —(o+2ti) 
az lo + ati, x") = pe eae) +S oInp- 1 — po or2t) 
p|N ‘ y 
O(1) when o>1 
v ( af + 2t1 *)) < 00 (le +2)) + ( : ) +00 N) 
-zero-free-eq| eT 5 oS ‘Ft pe. pe ge n ; 
Puente Ne, ye eee - (o + 2ti) —1 
(34.12) 


the last inequality following from Lemma ]33]33.3.1 


Putting (34.8), (34.9), and (34.12) into (84.11) give 
a i 
< ye —4 a —— { { O L ) L-zero-free-eql 
0< (Sy t0Ke ))+ (- ae (7) +00) (leery, (¢) 


(34.13) 
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Fix C’ > 0; when s =o + it and |t| > {> then —+— = O(InN) so (34.11) holds and 


we proceed as in item 1. 


Hence we consider t < 7 We use a different approach. Note 


/ / 
—F (% Xo) 2 7 (@X) when og > 1 


because the a their coefficients are A(n) > —x(n)A(n) (and they are real)?. 


Putting in (34.8) and (34.9 = give 


2) R ee =| + O(InN). (34.14) 


Cie aa 
Leto =1+ 2%: we estimate the sum in terms of the real parts of o — p. For any zero 
p we have 
6 1 26 
ro eg _ 
S a a din = *e p) 
L-noro-free-eq2 | = p\? = = [Sa — pal + [Rio —_ py)? (34.15) 
1 5 
= € + 1) R(o — p)*? = gite — p)’. (34.16) 


Hence (34.14) gives, for some constant A, 


1 
ee > 
(+55) mn —+AlnN> > n(——) 


S (p I< <Ey 


by (84.15). 


IV 
M 
Or 
M 
2 
+ 
x 


4 InN 


ore VO the RHS sum. If there are two zeros 


If R(p) 
(counting era then 


8 1 1 
< A4 . 
526+¢7 26 


This would be a contradiction if 


26(3 — 10A0d) 

5(26A +1) © 
Now choose 6 small enough and c so that it works for case 1 and satisfies the above 
inequality. 


c< 


Finally, note ¢(S, v) = ¢(s, x) for real characters, so if s is an (exceptional) zero so is 
s. Since there is at most one exceptional zero, it can only be real. 


? Alternatively, put in t = 0 in (34.11). 


476 


Number Theory, $34.4 


4 Prime number theorem in arithmetic progressions 


Theorem 34.4.1 (von Mangoldt’s formula): t-von-Mangoidt-formula For integer x > 2, x > T, and 
x primitive of level N > 1, 


w(z,x) a = 


a O (Hee 
Is(ai<T P 


: (In a | 


If y has associated primitive character x;, then for x > 1, 
|Y(z, x) — ¥(z, x1)| = O(n N Inz). 


Note that unlike in Theorem we have ~(x,x) & 0 as opposed to (x) & x. 
Remember this is expected because the average of values for a nontrivial character is 0, so 
there is cancellation in the sum. Moreover, there is no pole at s = 1 for L as there was in ¢, 
so the application of Cauchy’s Theorem in Step 2 will not give the x term. 


Proof. Step 1: We estimate u(x) using Theorem 32]32.4.2 Suppose x is an integer; the 


theorem gives 


ted (fo = (Hea) 2) 


n>1,nAax n In ( ) 
= eye In(n) 
<I] -- — ; 
< In(z) es (=) T |In (2) 


The difference is O (mF ) exactly as in (33.13). 


Step 2: We move the line of integration to #s = —1. Assuming that T is not the imaginary 
part of any root, by Cauchy’s theorem 


ctHiT 7% [! —14+iT 7 L/ 1-iT 7% L! c-iT 7% [! 
[ —(s,x)ds+ ax)de (s,x)ds+ ff —(s, x) ds 
e——___—_—____ ——_—___—___ 


ir s L cHiT a. wir os L irs L 
Ina Ly Th,2 


sO 
c+iT 78 L’ 
rvonata f a (-20) ds = Th oe Tho ay Ls = 
c-iT § L 


We estimate each summand. 
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1. For the horizontal integrals, we use the estimate |34.3.1|to get 


+O(n NT), s=o+Ti 


3(s—p)|<1 9 — P 
1 


e—picr S18 — P) 


< + O(n NT). 


We would like to bound S(s — p) away from 0. To do this, note that for |T| > 2 


large there are O(In NT) roots in with Sp € £[T,T + 1] by Lemma 84.3.1} Hence by 


tweaking T slightly we can assume |S(s — p)| > Raat Also by Lemma |34.3.1| there 


are at most O(In NT) terms in the sum, so the sum is O((In VT)?). Integrating gives 


-1+Ti 78 [! 
J, wpexds 


+Ti 


= O((n NT)*)O (=) [- Le*| ds 


=0 (aN. 


2. For the vertical integral, we use the same estimate, this time noting that |s — p| > 1 
for every nontrivial zero p, since Rp > 0. This gives that “(s) = O(In NT) and 


-1-Ti 7 J -14Ti 7-1 
/ anae eo O(n NT) [ ds 


uri 3 L ire. || 


_ Gee = =i oe | 


3. Note by Lemma }34.3.1|that 4(0, y) = O(L) = O(In(N + 1)). 
is 


Step 1 and (34.18) together with the above estimates give the first part of the theorem. 
For the second part, note that 


p(x, X1) ~~ p(x, x) = x (x1 (7) — x(n) )A(n)n~ 


l<n<z 


< bY An) 


1<n<a,n=p", p|N 
] 

cE lbs 

pin LP 


< S"InzInp =lIng in. 
p|N 


Theorem 34.4.2: ony-1-char There is a constant c > 0 such that for any distinct real x; and 


X2 to moduli N; and No, at most one of L(s, 1) and L(s, x2) has a zero 6 > 1 — nN ND)" 
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Corollary 34.4.3: oniy-i-char2 There is a constant c > 0 such that the following holds: Fix a 
level N. There is at most 1 character y of level N such that L(s, x) has a zero with 0 > 1—4. 


Proof o Theorem|34.4.4 The product y1x2 is a character with modulus N,N. By Theo- 
rem |34.3.1} Lo, x) < O(In N,No) for 1 <0 < 2. Let 


F(s) = C(s)L(s, x1) L(s, x2) L(8, x1X2). 
Then by logarithmic differentiation, 
-F (6) = -£66) — Flo) — Flo 2) = Flo xn) 
= So(1-+ xalr) + xan) + x(n) x2(m)) Aon) 
=3"(1 + xi(n)\(1+ xa(n))A(n)n-* > O (34.19) 


onechar-eq (34.20) 


3 
ll 
mn 


since the coefficients are nonnegative. 
Suppose 1, G2 are exceptional zeros of L(s, 1), L(s, x2); then putting Lemma [33.3.1 


into (34.20) gives 
1 1 1 
O(In N,N. - — ae 
(In NyN2) + ea v= =e 
Let 6 = min(1 = B41, 1- £2). Take o0 = 1 + 206 to get a a O(In Nip), i.e. 0 = In N, No with 
constant independent of N,, No, i.e. there is an appropriate choice of constant so that 11, y2 
are not both exceptional for level Ny No. 


Proof of Corollary|5.2| Fix a primitive character x of level N. Suppose x’ is of level N, 
whose corresponding primitive characters has level N’. aca the mores gives c such that 
at most one of L(s, x’) and L(s,y) has a zero 8 > 1 — >1- 


nN inn’ 


Theorem 34.4.4 (Prime number theorem in arithmetical progressions): puta» Let C > 0 and 
suppose x > e"™%)*_ If there is no exceptional zero for level N, there exists C’ > 0 such 


that li(2) 
m(x,amod N) =(14+0O oo MD 
)= (1+ Oe) 
If there is an exceptional zero 6 of level N with associated character x, 
1 ' 
n(x,a mod N) = ——~(li(x) — x(a) li(x®) + O(xe~C'V™*)). 
y(N) 
Proof. We have by column orthogonality |13]13.1.6] that 
1 1 
paismod-chit) (X, a mod N) = S> x(n) ACN) = S- y(N) S- X(a)x(n)A(n) = p(N) 
n<«x,n=a(mod N) n<ax x€(Z/NZ)* Y 
(34.21) 
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Letting y; be the primitive character associated to y, by Theorem |34.4.1) we have 


( xP z[(In x)?-+(In NT)?] 
psi-chi-estimateW (a x) = 7 dp zero of (x,x1) p | O ( T 
v(x) + O(n N Ina), y trivial. 


(34.22) 


+ In N In x) , xX nontrivial 


‘ xP: 3 
We Calunale 3) cencxdepional era arta) = in two steps. Assume T' > 2. 


1. By Theorem |34.3.3} there is a constant c such that for all |S(p)| < T, 


elnz 


ser | — 7% <a gi WNT = ye mNT 


2. Note the zero free region in Theorem]34.3.3]/means there is a constant do, independent of 
N,x, so that for all nonexceptional roots p, |p| > dp. Hence using N(T) = O(T In NT) 


(Lemma or Theorem |34.3.2), 


1 1 


soir ll ~ js(per Max(S(e); do) 


T dN(t) 
S I max(t, do) 
—, AVE) T N(t) 
~ max(T, do) i, ie o 


(Riemann-Steltjes integral) 


integration by parts 


= O(In NT) +f 0 (=~) dt 
= O(In NT) + O((In NT)*) = O((In NT)?). 


Putting these two estimates together, 


s@let [PI 
< O (e787 (In NT)?) . 


|S(p)|<T, p nonexceptional Pp 


Combining with Theorem |34.4.1} setting T’ = eVine and using N < eCVne we get 


clna ] 2 ] NT 7 ] T 2 aP gi B 
je(e.x) = 2 = 0 (we Fn ry? + HO) a a ) ) zi 


= O (xe F*(C +1)? ne + e-¥™((In.2)? + (C +1)? Inz) + C(na)?) —* 


—Cvine) 


psi-chi-asymptotic == O(xe 3 


3Here “nonexceptional” means with respect to level N. 
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for some C; > 0 independent of N,x, where the implied constant is independent of N, y. 


For the trivial character, (34.22) and (33.17) give 


psi-chi-asymptoticap(Z, X) = 2+ O(xe7@¥™* + InzInT) = x + O(ze-@2¥™") (34.24) 
Using (34.21), (84.23), and (34.24), we get 
1 x(a) xP Sfalins 
wv(x,a mod N) = (« —_ + O(ae“3V""*) 
e(N) B 


where the grayed-out portion appears only if there is an exceptional zero. (Note this can 
happen for at most 1 character by Lemma 34.4.2]) It remains to transfer the asymptotics of 
w to that for 7. 

The same argument as in Lemma 33.4.2|shows that 


w(z,amod N) | fr dy 
Ina | I ( ny? 


m(x,a mod N) = 


giving the estimate for 7. 


5 Siegel zero 


sec:siegel-zero In this section we obtain bounds on the exceptional zero to get a better error bound 
for prime number theorem on arithmetic progressions. We proceed in 2 steps. 


1. Show that L’(3,x) is small for 6 close to 1. 


2. Bound L(1, x) away from 0. 


From this, we get that L(G, \.) cannot be 0 for 6 too close to 1. 
Then we will be able to show the following improved form of Theorem |34.4.4 


Theorem 34.5.1 (Siegel-Walfisz): Given any C’ there exists a constant C’ depending only 


on C' so that 
li(a) 


y(N) 


m(x,a mod N) = Ope 7 ee") 


whenever 
Nang), 


Unfortunately, this bound is ineffective; the proof does not give a way to compute a 
suitable value of C’. 
Of course, if the Riemann hypothesis were true then it would solve all our problems. 


Theorem 34.5.2: If the Extended Riemann hypothesis holds (all nontrivial zeros of L(s, y) 
satisfy kts = 5)s then 

li(a) 
e(N) 


for c > N?, where the constant is independent of N. 


n(x,a mod N) = + O(«?(Inx)?) 
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5.1 L’'(G,x) is not too large 


Lemma 34.5.3: tem:t-not-large There exists an absolute constant C such that 
|Z(o, x)| < C(n NY? 


for any nontrivial Dirichlet character x modulo N and any o with 1 — es aon L 


Proof. Because L(o, x) = 72, xin) , by Proposition 32}32. 2.4} we can simply differentiate 
term-by-term to get 


3 x(n) Inn 
L (0, x) = ne : 
n=1 


Now we bound this sum by breaking it up into two parts. 
First note that for n < N, we have 


Hence i ; ; 
= yi? < ia = 
n n 


Step 1: We bound the sum from n = 1 to N. By (34.25), 


(34.25) 


eq:siegel-zero-1 


So 


eq:siegel-zero- 


al n)Inn 
Le 


for some C;. The last step follows from estimating using the integral fj‘ oe d= 4(InN)*. 


Step 2: Now we consider the sum from N +1 to oo. Let U(n) := Yyep41 x(m) and v(n) = 


Inn 


2. By partial summation 313.7.1} we have 


x inn = jm vu F x U(n — 1)(v(n) = u(n = ») 


Since v(n) decreases to 0 and |U(n)| < N (as SHY"! y(n) = 0 for any k), the first term 
goes to 0 and we get the bound 


oo 6 x(n) Inn InN 
eq:siegel-zero-3 d a < Nv(N) = a ae < N(n N)= N =e In N. (34.27) 
n=N+1 


where in the last step we used (34.25). 
Adding (34.26) and (34.27) together gives the desired bound. 
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5.2 L(1,x) is not too small 
Theorem 34.5.4 (Siegel’s inequality): cxcept-zero For each ¢ > 0 there exists C, > 0 such that 


L(1,x) > C.N7€ 


for all real Dirichlet characters y modulo N. 
Thus there exists Cl > 0 such that any real zero 6 of L(s, x) satisfies 1-6 > CLN~. 


First we prove the following lemma. 


Lemma 34.5.5: tem:tichi-not-smaii Let v1; and x2 be real primitive characters with modulus N, 
and Np, let 


F(s) = ¢(s)L(s, x1) L(s, X2) L(s, X1X2); 
and let 
A= L(1, x1) LC, x2) LC, x1x2)- 


Then the following inequality holds: 
8(1-s) 7 
F(s) > 5 — 7, (M2) : g<e<h 


Note the technique of getting information about a L-function of a single character by 
looking at F'(s)—a function defined using two characters—is a lot like what we did in showing 
Corollary ony-1-cha-2 using Theorem |34.4.2} We’ll comment more later on why we looked at 
F(s).4 


Proof. The main idea is to expand F'(s) in power series and bound its coefficients (equiv- 
alently, bound the derivatives of F'(s)) using the inequality from Cauchy’s formula, orol- 
lary |81]31.4.6 

We have 


In F(s) = In¢(s) + In L(s, x1) + In L(s, x2) + In L(s, x1x2) 


4A deeper reason why we often look at F(s) is that it is the zeta function of a biquadratic field. Thus 
we can prove nice facts about F'(s) by combining algebraic and analytic theory. We’ll give proofs that don’t 
require this knowledge. 
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This means In F'(s) is a Dirichlet series with all coefficients positive. Because the power series 
of e* has positive coefficients, this means that F'(s) also has all coefficients positive. We’re 
allowed to substitute any absolutely convergent series into a power series. (Is this right’) 
Suppose F'(s) = 7°-, ce 

Now we expand Fs) in Taylor series at s = 2. (We can’t do it at s = 1 because F’(s) 
has a pole there.) We have 


F(s) = Yo an(2— 3)", an = (1) —_ 


We calculate the coefficients using }32/32.2.4]and get 


°° f(n)(Inn)™ 
ee )(Inn) 


yO. 


Am = 


n=1 


In particular, form = 1 we have a,, > 1 since f(1) > 1. It’s 4. 
Because we know F'(s) has a pole of residue 4 = L(1, x1) LC, x2) £(1, x1x2), we consider 
the function 


A _ (5) = 5 7 > (am — A)(2— 8). 


m=0 


Let 2 be the circle of radius 3 (not its interior) centered at 2. Then for any y of modulus JN, 
|L(s, x)| < CiN for some C}, for all s in a bounded region away from 0 because by (34.2) 


Ls, =|f S(e)sa'dz| < N [Isa de, Sa) = Y x(n). 
nN<ax 
Therefore, 
ea:siegel-aero-a| I"(s)| < (C1.N1)(CiN2)(C iN, No) = Cy(N, No)’, Cy = Cc. (34.28) 
and for s € Q, 


eq:siegel-zero-4 


A 
(4) < 2L(1, x1) L(1, x2) L(1, x1x2) < 2C2(NiNo)?. (34.29) 
Now we use the inequality from Cauchy’s formula, Corollary |31/31.4.6} to get 


1 2 
fig Nh < eee tre) — CNN, 
ZzEQ re 


(3) : 


To bound F(s) — A, = %_o(bm — A)(2 — s)™ when 7 < s < 1, we first bound the sum 


s—1 m=0 
from some M (to be determined) to oo. 
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Firstly, 
S> lam — Al(2— 8)" < S> C3NPN5 |=(2 — 8) 
m=M m=M 3 
paaet 4 8 
3 M 
< CyN7 NG (5) 
<C.Ni Noe eV/4 = 0.78. 


We choose M so that C,N2N2e~4 € [se-4, 3]. Note the lower bound rearranges to 
M < 8InN,No + Cs. Then because the coefficients a,, are all nonnegative, we can drop 
some of them in the inequality to get 


d M-1 
F(s) — 7 21-AL 2-8)" — CaNPNGe MM 
Ss — m=0 
Sit (= ee C,N2N2e-% < 2 
T= 2? 44V +9 = 
1 d 
— F(s)>=- — 5)” 
(8) > 5-~—2-8) 
a r ete), e<ld+z 
2 1-s 
1 Cer 
a. (Mn), M < 8InN,No+ Cs. 
=.S 


This finishes the proof of the lemma. 


Proof of Theorem|84.5.4, Fix « > 0. We want to choose x; so that 0 > F'(s). Consider two 


Cases. 


1. For some y, L(s,y) has a real zero in the range (1 — ee 1, Then choose y; to be 
this character and (3, to be this zero. We then have F'((,) = 0. 


2. Else, let x; be any primitive character and (}, € (1 — 6) 1), Note the following: 
e In this case there are no zeros for any L-function in (1 — io) i so they all have 


the same sign as their value at 1. The value at 1 is nonnegative (in fact, positive) 
because the product expansion gives that the L-function is positive for 0 > 1. 


e ¢(s) <0 for0<s <1, and 


Thus F'(6,) < 0. 
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In either case F'(3,) < 0, and the choice of 6, depends only on ¢. From Lemma/}34.5.5} we 
now get the inequality 


1 Cr 
< 
a Dy 
A> Cz1(N,N2) 80-1) 


(N,N2)80-2), 


for some C.,, depending only on ¢. Now we also have an upper bound for A: 


A= L(1, xi)L(1, x2)L(1, X1X2) 
< (Cy In N,)LQ, x2)(C1 In N, No). 


Now suppose that Nj > N,. Combining the two inequalities and noting that In N, is a 
constant depending only on ¢ and is less than In No, we have 


L(1, X2) > C.5N, OP) in N)~* 


> CzNy? (In Np)? 
> Cz3N. 


By choosing the constant to be smaller, we may ensure that this bound also works for 
No < Nj. 

Finally, combining Lemma [34.5.3] and the bound L(1, x) > C:N~* immediately gives the 
fact that any real zero of L(s, x) must satisfy 8 < 1—CLN~. 


Note that it was essential to work with F'(s) rather than G(s) = ¢(s)L£(s, y): Something 
like Lemma |34.5.5| would go through, but if we used G(s) then G(s) may have a zero close 
to s = 1 so we don’t know the region where G(s) is nonpositive, and we may have to take 

1 


s = 6, arbitrarily close to 1. This kills the proof because of the term at When we work 


with Fs), the case where there is a zero close to 1 is dealt with nicely. 
5.3 Proof of Siegel-Walfisz 


Proof of Theorem|34.5.1,| Suppose there is an exceptional zero 3. By Siegel’s inequality|34.5.4| 
for any ¢ > 0 we have 


B-1<-C.N~. 
The prime number theorem in arithmetic progressions |34.4.4] gives 


1 
y(N) 


m(z,a mod N) = (li(x) — x(a) li(z®) + O(xe~C'V™*)). 
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We show that li(x’) gets absorbed into the O term. Indeed, we have 
goCeNe 2 eC Ving 
2 (Ine Cay =e vilnwy 


<— vinr> oN 


E 


1. 
Cae 
=> (=) (In x) 2° EN. 
Now given N < (Inz)°, choose ¢ = rae For large enough C’, the equivalences above give 


= 1. / 
gp Ree oO Wine oT heretore, 


p-1 
‘fee ic ies ee Ap ON _O' Vine 
nero Gan O(a- ax )=O(ee ) 


for some C’ > 0, as needed. 
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Chapter 35 


Special values of L-functions 


O Introduction 


0.1 Points of interest 


* * eq:acnf-exl 


* * eq:acnf-ex2 


* * eq:acnf-ex3 


00 1 1 1 
= Dye Bn +2 5n+3 °° 5n+4 


n=1 


See Mazur’s article in PCTM [?]' for an introduction to algebraic number theory that 
highlights these formulas. 


In this article we explore a formula that relates analytic with algebraic quantities: values 
of the zeta function with the class number, regulator, and discriminant of a number field. 
The main theorem is the following. 


TAvailable online athttp://www.math.polytechnique. fr/~chenevier/MAT552/barry_ 
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Theorem 35.0.1 (Analytic class number formula): tim:acar Let A be a number field. 
Then 


2 On ig 
eq:acnfl Ress=1 Cx(s) = ( - 7 PEK (35.4) 
K|VK 


where hx is the class number, Reg, is the regulator, wx is the number of roots of 
unity in AK, Ax is the discriminant, and 171, 2r2 are the number of real and complex 
embeddings of Kk. 


For prerequisites on the zeta function, see |http://web.mit.edu/~holdenl/www/ 
math/analytic-nt.pdf, For prerequisites on the class and unit group, see Chapters 3 


and 5 of http: //web.mit.edu/~holdenl/www/math/ant.pdf 


0.2 Road map 


Our plan is the following. First, we grok what’s going on by looking at the class number 
formula for quadratic fields, which already show all the different types of behavior. To prove 
the theorem for general K we need to get reacquainted with the embedding 0 : Og > R'xC*. 
Second, we’ll massage our formula into nicer forms for quadratic fields and for cyclotomic 
fields, and see what algebraic information we can milk out from the class number formula. 

How does this formula come about?? Cx(s) is a sum over ideals of Ox where the ideals 
are weighted depending on their norm; we are “counting ideals” with appropriate weight. 
The growth of Cx near s = 1 depends estimates for the number of ideals with norm < r. An 
expression for the analytic quantitiy Res,—; ¢x(s) then comes from combining the following 
geometric and algebraic information. 


1. Geometric: We count algebraic integers in kK. Geometrically (using the embedding c), 
they form a lattice in R"’ x C*®. We can estimate the number of points in a given region 
around the origin. This depends on the embedding, on the volume of a fundamental 
parallelotope of the lattice. This is how we get the quantity* 


21 (Qm)" 


|Ax| 


2. Algebraic: Note ¢x(s) is a sum over ideals, not over numbers in Ox. So we need to 
multiply by a factor that tells us what we’re off by in considering numbers rather than 
ideals; we can do this because of the multiplicative structure of Ox. This will depend 


Right now the section on regulators is missing; see/http: //en.wikipedia.org/wiki/Dirichlet’ 
s_unit_theorem#The_regulator 


°We follow the discussion in (http://math.stackexchange.com/questions/292104/ 
how-to-derive-the-class-number-formula 
“The grouping of the 27 is actually a little misleading: volume calculation gives the 7”? (think circles); 


think of the 2"? as grouped with the JlAkl. 
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on the units (the fundamental units and the roots of unity) and the class number. This 
is how we get the factor 


hx Regx 
WK ; 


0.3. First explorations 


Make sure you remember the following. 


1. How to show Res,-; ¢(s) = 1. (Used a “summation by parts” argument. The 
same argument analytically continued ¢(s) from Rs > 1 to Rs > 0.) 


2. The embeddings used to prove the finiteness of the class group and find the 
rank of the unit group. The definition of the regulator. 


Problem 35.0.2: Recall that the zeta function for Q can be defined as 


(1 
prime p 


for s > 1. How do we modify this definition to define the zeta function for an arbitrary 
finite extension K/Q? 


The identity above relied on unique factorizaiton. In general we only have unique factor- 
ization of ideals, and we measure the size of an ideal with the norm. So we define 


and find by unique factorization of ideals that it equals T],co, prime (1 — sur) 


First we explore the class number formula for quadratic number fields. 


>We always restrict to nonzero ideals 
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Problem 35.0.3: — 1. Consider the zeta function for K = Q(i). When we expand 
out ¢x(s) = ©, %, what do the coefficients a, represent? 


. Evaluate Res.—1 ¢x(s) (recall how we evaluated Res,—) ¢(s)), using some area 
calculations. 


. Can you write ¢x(s) in terms of L-functions you are familiar with? De- 


rive (35.1). 


. Do the same for Z[,/—2], Z [] and Z[/—5]. What’s different about the 
last case? 


. Do the same for Z[V/2] and Z [4). (The argument is more complicated, but 


make a guess.) Derive (35.2) and (35.3). 


. Conjecture the general class number formula (we already told you, but don’t 
look back). Try to generalize your arguments for quadratic fields to prove it. 


. We saw that in (35.1) through (35.3) that we can evaluate in closed form series 
of the form }) +- (where the + are periodic). Can you do this in general? 


Given a character y : Z/N — {—1,1}, evaluate in closed form 


y xin) 


n 


in 2 ways: algebraically, as in (35.1) through (35.3), and analytically (without 
mentioning algebraic quantities). Try to simplify your formula as much as 
possible. What happens when you equate these two? 


Hint for the analytic expression: (a) power series are nice, (b) © + reminds us 
of logarithms. However we seem to have nasty sums such as >> 
What’s the technique? 


il 
n=m (mod p) n° 


For a leisurely account of the class number formula for quadratic fields (assuming few 
prerequisites) see the PROMYS notes [?]. We will discuss 1-5 in Section [I] 6 in Section [1.1] 
and 7 in Section 2] 
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1 The analytic class number formula 


sec:acnf 


1.1 Quadratic fields 
subsec:acntgt This section will be mostly to build intution for the general case; we will give a 
self-contained proof of the general case in the next section. 


1. We have ¢x(s) = Dacig FEF = Lnw1 2 Where 


dy, = the number of ideals of norm n. 


Now we specialize to the case K = Q(i). Every ideal is principal, of the form (a+ bi), 
and its norm is a? + b?. So does a, count the number of solutions to a? + b? = n? 
Almost: multiplying by a root of unity gives the same ideal. (a+ bi), (i(a + bi)), 
(i?(a + bi)), (#3(a + bi)) represent the same ideal, so a, is + the number of solutions. 


We can think of this another way: each ideal is of the form (a+ bi) for a unique pair 
(a,b) € Z? with a > 0 and b > 0 (“in the first quadrant”). Hence a, counts the number 
of solutions 


an =|{(a,b) €Z’,a>0,b>0:0+h =n} |. 


2. We don’t have a nice expression for a, that we can work with analytically, but we can 
work with }7/_; @: it counts the number of solutions 


>> a, =|{(a,b) € Za >0,b>0:04+8' <n}, 
k=1 


i.e., the points inside the circle of radius \/n in the first quadrant. 


This suggests that we use summation by parts on Cx: 


Cx = d ayn * 
7 > (>: a) Geis 
2 > (>: a) [ ee pe ae. 


Now we need to estimate >77_, ax. It is approximately the area of the circle of radius 
/n in the first quadrant, so 


eq:acnf-area S> Qk = “n + En where En = O(n). (35.5) 
k=1 
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(The standard argument is to consider a square for each lattice point; these squares ex- 
actly cover the sector except possibly for a strip near the boundary of area proportional 
to the circumference, ~ \/n.) Putting this in, we have for s > 1, 


oe) 


earacntaiGK(s) = > (Fn + en) [ pea hap (35.6) 


n=1 
writing n = (e«—n) +n, 


0° aa ntl nti 
=> 7 (/ a +f a *!(2 —n) da+ en | —) 

T oo ap OS n+1 7 n+1 i 
= xd += > (f a (t —n) dx +en f oo dx) 


n 


since €, = O(n2). The last sum converges for Rs > 5 to an analytic function, so 


we obtain an analytic continuation for Cx to Rs > s. (We have Fs > 5 rather than 


Rs > 0 for ¢ because of the error term.) Reading off the coefficient of +, we see 


a 


Res,—1 Cx(s) = rai 


Summarizing, the 7 comes from an area calculation, and the 4 comes from the fact 
there are 4 roots of unity in Q(i). 


3. We’d like to write Cx(s) = I], (1 _ a) in terms of primes over p. In a quadratic field, 

a prime p in Z either 

(a) is inert in K, i.e., is a prime of norm p? in K 

(b) splits into 2 primes of norm p, or 

(c) ramifies. 
For a field of discriminant D, these cases happen when (2) = 1, (2) = -—l,andp| D, 
respectively. We get 

1 1 4 1 
¢x(s)= J] ——~ Il (<—] II ( =) 
p inert a p . p splits dis D p ramifies 1— 1G 


1 


7 II L=y-* ele a a ate ¢ ae 
= ¢(s) TI (<>) 


yp \1 — (B) P-* 
(xls) = Cis) Liex), y= ( 


*) aad (35.7) 
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(The L function encodes the splitting law for primes in kK.) Consider K = Q(z). Here 


( - 
1 =1] d4 
x(n) = 4” ‘ ae Now we know Res,(¢x) = 1, so we must have 
—1, n=3 (mod 4). 


T x(n) i ee | 
— L —4 =—=. 1 _ — —-— eee 
g — 2 (sx) d. ns ar aan ol 


Incidentally, note that the identity (35.7) gives (via expanding the Dirichlet series) the 
formula 


| {(a,b) € Z? : a? +B =n} = 4(dy(n) — d3(n)) 
where d;(n) is the number of divisors of n that are =7 (mod 4). 
4. For /—2, what changes in the above calculations? 


(a) We now count instead the solutions to x? + 2y? < n. Alternatively, keeping 
the shape the same, we’re counting the lattice points inside the circle where the 
lattice is generated by 1 and V/2i (each rectangle has area \/2; in general it has 


area 4/ Ix). The number of points is approximately /2zn. 
(b) There are only 2 roots of unity, so we would count points in the whole upper part 
of the circle/ellipse, and divide by 2 instead of 4. 


Similarly for /—3 we get Yan and we divide by 6. We get 


ii 
Res Cqiy3i(s) = F 
2 


hie 


=f 
Lod i] 1 1 
5) 


For /—5, the problem is that the class number is > 1: a prime can now split into 
nonprincipal ideals, so we can’t simply count the number of solutions to x? + 5y? = 1. 
We'll do this rigorously later, but it makes sense that since there are 2 ideal classes, 
we'll only capture half the sum in this way, and we’ll have to multiply by 2. 


In general, if d is a imaginary quadratic field with discriminant D, then 


eq:acnf2 Res,=1 Cx( ) — 
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5. Recall the embedding we use in order to apply geometry to K = Q(V/d) with d positive: 
atyVd4 («+yVd, x —yVd) =: (2',y'). The area of the fundamental parallelogram 
of this lattice L is /|Ax|. 


The norm is different now; it is x’y/ = x? — d*y. Again we have a factor hx. We now 
count lattice points under the hyperbola x’y' < n in the first and second quadrant. 
However, now associates can differ not just by a root of unity (—1), but also by a unit, 
which must be some power e«” of the fundamental unit ¢ (which is sent to the lattice 
point (¢, +)). Drawing a picture, we see that each x +yVd with norm at most n has an 
associate in the region bounded by z’y’ = n and the lines joining the origin and (1, 1), 
and the origin and (e, t), or the rotation of this region by 90°. The area of these slices 


can be calculated to be 2Ine. Thus we expect around TE lattice points. We obtain 


|Ak| 


the following: 


If d is a real quadratic field and ¢ a fundamental unit, then (note wx = 2) 


eq:acnf3 Res,—1 Cx(s) = 


In the case of V5, the discriminant is 5, the character is y(n) = (2), and the funda- 
14+V5 
a 


mental unit is , SO 


1.2 Proof in general 


subsec:acnf 


We now prove Theorem |35.0.1]/in 4 steps. Note that both (35.8) and (35.9) are special 


cases of (35.4): in the first case there is 1 pair of complex embeddings, hence the factor of 
m; in the second there are 2 real embeddings, hence the factor of 2. In general the area of 
a fundamental parallelotope is \/|A,|2~™ (Proposition |16.3.1). The group of units of a real 
quadratic field is 1-dimensional, so the regulator is simply the logarithm of the fundament 
unit. 


1.2.1 Reduce to sum over principal ideals 


We reduce ¢x(s) to a sum over numbers in Ox, not ideals. We separate the sum by ideal 
classes: 
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1 
Jas 


-> 


[]EClxc ac [I] 
where the sum [J] € Clx is over ideal classes. 
Now for each o choose 6; € [I]~! 


=> M/ ah MN(b7)° 


[eClic a€[I] 


fo Dey. Du 


WJEClK (a),a€by may” 


b;)° eq:acnf4 (35.10) 


since a ++ ab; is a bijection from the ideals in |/] to principal ideals that are divisible by 6, 
i.e., principal ideals in the form (a), a € b. If we manage to show each of the inner sums is 
21 (2n)"2 Regie, then we’ll be done. 


ViAxwx 


1.2.2 Reduce to sum over numbers 


The sums are now over principal ideals, but we’d like them to be over elements. The 
obstruction is that many elements will give the same ideal, so we’d like a canonical way of 
choosing a generator of (a). We’d like a region such that every set of associated elements 
has exactly one element in that region, a kind of fundamental domain. 

Any two generators will differ by a product Ceft---e;"°7 where ¢ is a root of unity and 
E1,..-,€p4s—1 are the fundamental units. 

If we have a vector space, and we say 2 elements are equivalent if they differ by an element 
of a lattice, we know what a fundamental domain would look like—a parallelotope. We have 
here a product rather than a Z-linear combination, so we’ll have to embed and then take 
logs. 

Recall the embedding and the log map from Proposition 


[. Ox o=(01,..,0r+s) an, cs £ =(In|-|,...,!n|-|,In 2]-],... In 2]-|) arts. 


(The kernel of this map is the roots of unity.) Recall that L(Ux) is the subspace with 
rj+:::+2%,45 = 0, ie. the subspace with x-(1,...,1) = 0. We choose our canonical element 
a such that the projection of €~'(a) onto this subspace is in the fundamental parallelotope 
made by the @~'(e1),...,€71(€p45-1), i.e. we require 


ea:acnt-latticel (ar) € ([0, 1]2-*(e1) + «++ + [0, No" (e451) HR(1,.--, 1) (35.11) 


=P. 


(For a set S of scalars and a vector v we let Su = {sv:s € S} i). ea the inverse image 
under o and noting the kernel is the roots of unity (Proposition |18.2.2) , we have the following. 


e For any a € Ox, there are exactly wx associates 0 of a such that 


o(G) €RL"(P). 
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Combining with (35.10), 


eq: acnfs¢K(S S> MM (b;)° (35.12) 


s 
WK [eClx aweRE-1(P)nb; Na) 


1.2.3. Reduce to a volume calculation 


Now we show how the sums in (35.10)) reduce to a volume calculation. We claim the following. 


Lemma 35.1.1: temacntvoi Let L be a lattice in R”, and let A(L) denote the volume of its 
fundamental parallelotope.® Let V be a bounded, measurable set containing the origin that 
the points B(OV,c) at a distance of at most c from the boundary satisfy 


Vol[B(OV,¢)| 5,1". 
(There aren’t too many points near the surface.)’ Then 


1. The number of points inside tV satisfies 


Vol(V) 
A(L) 


EVO L| = i™ + O(t") 


2. Suppose [0,1]V = V. Let N(x) = (inf{c > 0: 2 € cV})™ (the norm associated to V, 
made homogeneous of degree m). The function 


f(s) = Dap 


cel 


is meromorphic for Rs > 1 — + with Res,.; f(s) = ALY 


Proof. 1. Take a fundamental parallelotope and place one at each point in tV N L; now 
bound the difference between this area and the area of tV, cf. (35.5). 


2. This is exactly as in (35.6). Noting that the number of z € L with N(x) < n is 


Distinguish this from Ax; the relationship between Ax and A(L) is given by A(L) = \/|Ax|2~"2; see 
Proposition 
"See also VI.§2 in Lang. His Theorem 2 is phrased in terms of (n—1)-Lipschitz parametrizable boundaries. 


498 


Number Theory, 835. 1 


an +e, for some e, = O(n 


fG\= 3 (en + en] - cde 


n=l 


writing n = («© —n) +n, 


oo VM a n+1 n+1 n+1 
- =i ol Cf go + | gz * "(2 —n)dx+ en | aay 


= i) x *dx+ 36 3 ([" a * (2 —n) dx + en i oo ac) 


n=1 
Vol(V) 1 ven Sf on ae er 
= Athy e210 PO. @—n)de+en f au ' de) 
= 


since en, = O(n ™ a), The last sum converges for ts > 1 — 4 to an analytic HanCMOD, 
so we obtain an analytic continuation for Cx to Rs > 1 — — (We have #s > 1 — a 
rather than Sts > 0 for ¢ because of the error term.) Reading off the coefficient. of — 
we see 


+, 


Vol(V) 
A(L) 


Res,-1 f(s) = 


1.2.4 Calculate the volume 


We will apply Lemma}35.1.1] to 
V = (0, 1Jé-'(P)b, 
where P is as in (35.11). Here, the norm is N(x) = |z1---2,,4,,| on [0,0o)V C R™ x CY”, 


We calculate the volume of V: We want to evaluate (note x; € R or C) 


ne / ee Rs Baia = / , ee 27 (2m) Er 41° 1+ Erte, Ak1 ... Ay, 


where dia Lys. 05) SOMOS (iy x ac25 Rigg) Wy atin a yo a) EP (Wigs cag ny SU eta) Sepp aye) 
and we note that the integral only depends on |;|, so 


1. for the integrals over R we can take the integral over x; > 0 and double them (recall 
—1 is in the kernel of L), and 


2. for the ones over C we can switch to polar coordinates (@,r) € R/2m x Ryo, and 
integrate over # to get 27. 


We know the area of P, so we change coordinates to make the integral over P. Change 


fel, 1 <isn ; 
coordinates via the map ¢ (so 7; = 1 = ) which has determinant of 


/ . 
ev? rn <i<ntre 
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. oat ff f / ah 2 
Jacobian 27"2e% --- etretrti/2... e*rtr2/? to get 


fof 2" (Qr)"etrsi/? ... et9/2(Q-M eM... ePritra/? dz, ... dx’,) 
P—[0,00)(1,...41) : 


geeey 


=o ta? / . | ew ..-e*tr2 del... dr!) 
P—[0,00)(1,...51) 
1 


Frat 40 + Arn 
= are | i eyvritr2 dx dy 
P J-c 


1 
= orn f ——. dx 
PVJ/M%1+12 
1 
= 2" 7” Vol(P) ——— 


= 2" a" Regie. 


(Reference for last line) Now each sum in (35.12) has, by Lemma/|35.1.1| residue equal to 


1 — Vol(V) 
Ress=1 >, M(a)s A(o(b;)) 


a€Ré-1(P)nb; 
rg" R 
= aU MEK by Prop. [16.31 
[Ax|2-7 (7) 


> Resja16k(s)=— by [Re x a) 


WK (neClx a€RL-}(P)nb; 


1 gris 
S62 Se 
Ce inecig [Ax |2-"22(b;) 
271 (2m)" Regy 
WK |Ax| 


This finishes the proof. 


2 Special values of L-functions 


sec:special-1 


2.1 Evaluating L(1,.) for quadratic characters 


We were able to find closed forms for the sums (35.1)—(35.3). The same method works in 
general. 

We now evaluate L(1,) for quadratic characters in 2 ways. First, we know that we 
have an algebraic expression for it. If x has conductor D, then letting K be the field with 
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discriminant D, ?? 

L(1, x) = Res.=1 Cx(s) — 
Second, we find an closed form for it analytically. Equating these two will give us a way to 
calculate the class number analytically. 


Theorem 35.2.1 (Formulas for L-functions): Let x be a character modulo m. Let T(y) 
denote the Gauss sum ))j¢(z/m) X(J)X* (J) where x*(j) = ¢’ is the additive character. We 
have 


L(l ) _ ( _ 760 yoke (Z/m)* x(k) In sin ah xX even 
“ {slo Wre(z/m)x X(R)k, x odd. 


In particular, if K is a real quadratic field with discriminant D, then 
k 
Li,x)=-— dX x2) Insin 
m 


(In [?], this is Theorem 3, p. 336 and Theorem 1, p. 344.) 


Proof. bs write L(1,y) as a sum over residues modulo n, and then write the nasty sum 
Vn=m = (partial zeta functions) in terms of nice sums using characters. We have 


=¥ xin) 2 > 


keZ/'m n=k = py 


= vy xh) & us Roky i orthogonality of characters 


keEZ/m jeZ/m ™ 


= a> ayer ys 


kEZ/m jEZ/m 


The inner sum is just the Taylor series of a logarithm. We have® 
=—In(1 —@). 
Letting x7 (n) = ¢™, we find the sum equals (using Proposition |13.2.2) 


L1,x) = -— p> DS x(k) In(1 — ¢’) 


™ eZ ]/m kEZ/m 
1 
=-— DY) Gy xt,) ma - ¢) 
jEZ/m 
'- = 
= Syn -&) 
je(Z/m)* 


8We use the fact that if ba ea, converges, then lim,_,;—- r"e’®" a, equals that sum. 
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where by Theorem |13.2.1]the sum is 0 for 7 4“ m. actually a slight generalization We want 
to express this sum in terms of In of real values. We use a standard trick: match up j and 
—j in the sum. We consider two cases. 


1. x is even, ie., x(j) = x(—j). Then the sum equals 


OES xa) tma—C%) = - 2 ama - 7 
™ 2 se@jm)* ™ * je(B/my* 
a De ain(ct 4) 


where the 2 came out as a In2 and we use Vjez/m X(J) = 0. 


x is odd, ie., x(—7) = —x(y). Then the sum equals 


03 Se Hnaa—) -ma cy) =-7AW EE amen) 
jE (Z/m)* jE(Z/m)* 
ga a 
jE(Z/m)* 
= TW SY xs) 
jE(Z/m)* 


2.2. The class number counts quadratic residues/nonresidues 


Where else does the class number appear? There is no explicit formula for the fundamental 
unit, but remarkably we can find an explicit formula that works for all real quadratic fields 
and gives a power of the fundamental unit. The class number appears as the exponent. 


Theorem 35.2.2: Let K be a real quadratic field with discriminant D and character x. 


Then 
7i= Il (sin ~) ac 


D 
alD,0<a<# 


is a unit and if ¢ > 1 is the fundamental unity, 


. : wh  . Ta 
In particular, Ih(2y=-4 sin > Ia) sin 
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For p= 1 (mod 4), think of the last statement as saying that quadratic residues mod p 
cluster at the beginning of (0, p/2) (where sin ** is small) and the quadratic residues mod p 
cluster at the end. What about p= 3 (mod 4)? 


Proof. Exponentiate both sides of 


1 
h= ae x(x) Insin = 
te tlLDO<ar< 2 
(Ine)h= So Insin —(—x(2)) 
tL DO<«<2 
c= TI sin =) fa 
al DO0<a<2 


In the imaginary quadratic case, we have an explicit formula for the class number that 
depends just on calculating quadratic residues, and not on any anything analytic (In, sin,...). 


Theorem 35.2.3: Let p = 3 (mod 4) and let R,N be the number of quadratic residues 
and nonresidues in (0,5). Then the class number of Q(,/—p)is 


In particular, R > N. 


(p. 346 in BS) 


Proof. We use an averaging argument: to simplify a sum we pair up elements like x(x), x(m— 
x) or x(x), x(} + 2) for cancellation. 
Consider 2 cases. 


1. If |D| is even, it is a multiple of 4. Note 2”~! + 1 is not a quadratic residue modulo 2” 
for r > 2 (why), so } + 1 is not a quadratic residue modulo m, and y(m) = —1. We 
pair up x(x), x(x + 3) to get 


2. If |D| is odd, noting the character of an imaginary quadratic field is odd, x(—1) = —1, 
again we pair up x(x) and y(m — x) to get 


2 


h= x(2)e+ Yo x(a). 


|D| O<a<} O<a< 
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There is a sum of y(x)a, but there’s a trick to get rid of it. We instead flip some of 
the x into m — x, so that we hit exactly the multiples of 2, and then take out a y(2). 


We get 
x)= YO xee+ YO va) 
|D| O0<a<¥ O0<a<¥ 
Subtracting we get 
— x(2) 


Now apply to p=3 (mod 4), Z [=]. 


Note that there does not seem to be an “elementary” proof of R > N for p= 3 (mod 8). 
Seehttp://mathoverflow.net/questions/25707/intuition-for-a-formula-that-—exp 
for a discussion. 


3 Cyclotomic! 


We can calculate for cyclotomic 


/ 
pe I] 24,» 


9m-1,mP 
oe mR X#XO 


for even and odd characters 


oe? : 
EC, x") =—_| >) "In |1 = ¢"" 
VP r=0 
ee Tv = 
(Lx) = SalFO*™) 


and 
Qm— 1 m—-1 |m-1 


; . 1 re 
h= . iu S~ 0" In|1 — Cl Gye! FO) FO) Be 


1 | r=0 


“ 


=—ht h* 


Show directly h* is class number of totally real subfield. 
problem for CFT: Show that if L/K is an extension with no proper unramified extension, 
then hr | hy. 


3.1 Appendix 


Theorem 35.3.1: Let P be the fundamental parallelogram of L(Ux), and let Vol be the 
(r, + rg — 1)-dimensional area. Then 
1 


Vr 72 


Regx = 
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Proof. The regulator is defined by deleting any column of the matrix with rows L(¢<;), i-e., 
it is the area of the projection to a hyperplane x; = 0. 
These projections have the same area. 
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Chapter 36 


Zeta and L-functions in number fields 


inf In this chapter we will define zeta and L-functions in number fields, to obtain density 
theorems for primes in those fields, in particular: 


1. Prime Number Theorem for number fields, and 


2. Chebotarev Density Theorem. 


To define L-functions, we will have to generalize our definition of characters. 

As in the previous two chapters, we need a functional equation and analytic continuation 
of the L-function in order to get good asymptotic estimates. This presents a significant 
challenge. There are two approaches: 


1. (Hecke) Generalize the proof for the L-functions over Q. Namely, use a higher- 
dimensional analogue of theta functions. 


2. (Tate) This is an illustration of the local-to-global principle. First define L-functions 
over local (complete) fields. This is easier because there is only a single prime to work 
with. Then put these L-functions together to get a L-function for the global field. 


Note that L-functions over complete fields are much simpler—provided that you have the 
background in measure theory and functional analysis. We will give the required background 
in Section 

As an illustration, note that the functional equation for ¢ (and similarly L) becomes more 

transparent (€(s) = €(1 — s)) after we define €: 

s s 1 
é(s) =a 40 (=) C(s) =" °F (=) Il ; 
2 2 l—p-s 
eS 


p prime 


The presence of the term in front seems quite mysterious. However, we can think of it as 
coming from the infinite place; so instead of thinking of € as a product over primes we should 
think of it as coming from a product over places. We will define the zeta-function over a 
local field K by 


Cif 8) =f. F(a) lal; da 
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where we choose for f a function that is its own Fourier transform (to get a good transfor- 
mation law). Note that the measure here is the Haar measure on kK. For the case K = Q,, 
f is a characteristic function; by calculating this integral on the sets {a: v,(a) =n},n EZ 


and summing, we get a geometric series which becomes the factor a up to a constant. 
For the real place, we choose f(x) = tere and get 7 2I' (2) out. Magic. 


1 Zeta and L-functions 

2 Class number formulas 

3 Density theorems (weak form) 

4 Analytic continuation: Hecke’s proof 
5 


Measure theory and functional analysis 


measure-theory 


5.1 Measure theory 
For a set E ¥ ¢ define the power set 
PR 20? 21h e re Bh, 
Definition 36.5.1: A subset 6 C P(E) is a o-algebra it satisfies the following properties: 
1. FEB. 
2. B is closed under complementation: [ € B implies [= E\T € B. 
3. {[,:n>1} C B implies UR, € B. 


Note that items 2 and 3 imply that a countable intersection of elements in BG is in B, and 
a difference of sets in B is in B. 


Definition 36.5.2: We call (£, 8) is a measurable space. A measure on (EB) is a map 
pt : B > [0, co] such that 


1. p(d) = 0. 


2. (Countable additivity) If {[,, : n > 1} is a family of pairwise disjoint subsets of E, 


then 
lu (U rs] = > wll), 
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i.e. the volume of the whole is the sum of the volume of the parts. 


Compare this to the definition of a topological space—measurable spaces have measure- 
able sets while topologies have open sets. 


Example 36.5.3: Define a measure pz on the integers Z by associating some ju; > 0 for each 
integer 2, and setting 
wT) = Sy Hi 


ie 
Our strategy is to start with some class of nice, well-defined subsets, and generate more. 


Definition 36.5.4: For a family of subsets C C P(E), define the o-algebra generated by 
C, denoted by o(C), to be the smallest o-algebra containing C. In other words it is the 
intersection of all o-algebras containing C. (This is well-defined since the power set is a 
o-algebra containing C.) 

If E is a topological space and C = {I C E:T open} then o(C) = Bz is called the Borel 
o-algebra. (The sets are called Borel sets.) 


Lebesgue showed that there exists a unique measure on Bg, such that gw (IL) = vol(Z) 
for rectangles [. 

DEFINE integrals given a measure... DEFINE L’... 

The following shows that given one measure, “essentially” all other measures can be 
written in terms of an integral. 


Theorem 36.5.5 (Riesz representation): Suppose that (F£, B,v) is a o-finite measure space 
and p is a finite measure on (EL, B) with p < v. Then there is a unique y € L'(v;R) such 
that 


a) = | ed 
for all T € B. 


Proof. Stroock [add reference], 8.1.2. 


Definition 36.5.6: Let js be a Borel measure on a locally compact Hausdorff space X and 
E be a subset. yw is outer regular on F if w(£) = inf {u(U): U D E,U open} and inner 
regular on F if u(F) = sup{pu(k): kK C E, kK compact}. 

A Radon measure on X is a Borel measure that is finite on compact sets, regular on 
all Borel sets, and inner regular on all open sets. 


5.2 Haar measure 


Definition 36.5.7: Let G be a topological group and yu a Borel measure. p is left trans- 
lation invariant if for all Borel subsets F of G, u(sE) = yw(£). Ditto for right translation 
invariant. 
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Let G be a locally compact topological group. A left (right) Haar measure on G is a 
nonzero Radon measure jz on G that is left (right) translation-invariant. A bi-invariant Haar 
measure is a Haar measure that is both left and right invariant. 


Theorem 36.5.8: Let G be a locally compact group. Then there exists a left/right Haar 
measure, unique up to scalar multiple. 


Proof. |RV99|, Theorem 1.8. 


5.3 Fourier inversion and Pontryagin duality 


Definition 36.5.9: Let G be an abelain topological group. A continuous complex char- 
acter on G is continuous homomorphism G — S', where S! = {z € C: |z| = 1}.' 

Under multiplication, the continuous complex characters form a group G, called the 
Pontryagin dual of G. Give it the compact-open topology, i.e. the topology such that 


W(K,V)= {x eG: x (ie) C Ve K compact, V open 
is a neighborhood base for the trivial character. 


Definition 36.5.10: Let G be a locally compact topological group. A Haar-measurable 
function py : G > C in L™(G) is of positive type if for any f € C.(G) (continuous, 
compact support), 


{f vs tOF(s) as FO at > 0. 


GxG 


Definition 36.5.11: Let f € L'(G). The Fourier transform of f is the function f :G > 
C defined by 


FO) = f FWXW) ay. 


Definition 36.5.12: Define V(G) to be the complex span of continuous functions of positive 
type ib G and V1(G) = V(G)N L(G). 


Theorem 36.5.13 (Fourier inversion): There exists a Haar measure on G such that for all 
fev), 
f(y) = f Fodxtw) ax. 


The Fourier transform f 4 f identifies V!(G) with V1(G). 


Example 36.5.14: The Pontryagin dual of R is R, via the identification y +> e?""*¥. The 
Fourier transform is 


Fw) = [ Fae" ae. 


' Alternatively, G > R/Z, thought of additively. 
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The Fourier inversion formula reads 


f(c) = CONSTANT [ Fly)ei™ dx 


The Pontryagin dual of R/Z is Z, via the identification e?""*. The Fourier transform is 


fly) = ff (we de 


and the Fourier inversion formula reads 


f(y) =CONSTANT >~ f(ye’™™". 


neZ 


The Pontryagin dual of an abelain group G can be identified (noncanonically) with G 
itself. Fourier inversion formula gives character formula! Connect with stuff in chapter on 
characters. 


Theorem 36.5.15 (Pontryagin duality): The map a: G— G defined by 


a(y)(x) = x(y) 


is an isomorphism of topological groups. Hence G' and G are mutually dual. 
Measure on local fields. Relate to metric. Ostrowski’s theorem again. 


Theorem 36.5.16: tim:rest-proa Suppose 


/ 


G= I(@. H,) 


e 


is a restricted direct product of locally compact abelian groups G, with respect to open 
subgroups H,. Then 


/ 
1. 


G 


IIe 


APPLY TO IDELES/ADELES! 


6 Analytic continuation: Tate’s thesis 


The main steps of the proof are as follows. 


1. Define an additive and multiplicative measure on local fields, and classify all characters 
on these fields. We divide into three cases: real, complex, and p-adic. 


2. Define local L-functions and prove a functional equation for them. This functional 
equation comes directly from the Fourier inversion formula applied to the local fields. 
Compute the functional equation in each of the three cases. 
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3. Show that the adele ring—a restricted direct product of local fields—behaves nicely as 
a product. That is, the following hold. 


(a) The measure is the product of local measures. 
(b) Products of nice (continuous, Z') functions on the K, give nice functions on K. 


(c) The Fourier transform of a product is the product of the Fourier transforms. 
Moreover, the adele is self-dual, because it is a restricted product of self-dual spaces. 


4. Establish the Poisson formula and Riemann-Roch Theorem. Embed K into Ax and 
think of K as a “lattice” in Ax to apply the Riemann-Roch Theorem. The local 
functional equations plus the Riemann-Roch Theorem give the analytic continuation 
and functional equation for the global L-function. This formula gives a relationship 
between a character and its dual, but we know that Ax is self-dual. 


5. Specialize to the case of Hecke characters to obtain the classical functional equation. 


We now carry out this program. 


6.1 Haar measure on local fields 


6.2 Local functional equation 


Definition 36.6.1: Let f be a NICE function. Define the local L-function of f to be the 
function on quasi-characters with positive exponent given by 


Traditionally, we think of L functions as functions of a complex variable. We recover this 
viewpoint if we write c in the form 
C(x) = eo(2x)|a|* = co(x)|a|7™, 


where co(x) is a character in the same equivalence class as c(x). Then fixing co, we can think 
of L(f,c) as a function in s: 


L(f, co, 8) = L(f, co] - |’). 
Lemma 36.6.2: For any f,g NICE and any quasi-character c with exponent in (0,1), 
L(f,)LG,2) = Lf, QL(g,). 
Here é(x) = |z|e(x)71. 


In other words, where it is defined Ane is a function determined only by c. Thus we get 


(f.¢) 
the following. 
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Theorem 36.6.3 (Local functional equation for DL): tnm:ocatteq A local Z-function has ana- 
lytic continuation to the domain of all quasi-characters given by a functional equation 


L(f,¢) = p(OL(f,®, 
where p(c) is a function independent of f. 


We now calculate the functional equations for K real, complex, and p-adic. To calculate 
p(c), it suffices to choose a nice f and compute 


since this function is independent of f. The results are summarized in the following table. 


Theorem 36.6.4: The quasi-characters for K are given in the top row of the table. Defining 
the corresponding functions f as in the second row, th Fourier transforms of those functions 
a are those given in the third row, the ¢-functions are given in the fourth row, and the 
functions p(c) are given in the fifth row. 


R C K, 
ce lee ela)ial* Cn(@) character 
sign(x)|2|° where c,(re’’) = e’” of conductor f = p” 
fl fs)=e7 fn(8) = fan = PONT oop) -1 
jis) =se™ (In| e—2nlsl? n>0 
| sile-2eb 20 
f | fw) =f) Faly) =i"! f_a(y) fn = (MO) 2NFlis; 
fay) = tf) 
L L(f, | . 2 = aa) L(Fasenl - |?) = Ll Fray Ci olf) = 
L(fa,|- |) =F T (24) | 2m 2 (st) | ga-smaeg (cue?™(satbon) 
Lf P)= FT (5) | DG eal -P) = ee 
L(fz, £|-|*) = arin) (1 — + ll) | Pmvenl 1) = MOL +A 
in (258 
p | el P)= P(Cn|- °) = p(| - [°) = Sto 2 
2) 8a * cos (=) I(s) (—i)" (2m) sa — (| - |) = 
atl |) = @nF(G-a)+ eat ee 
—i2!-®7-* sin (=) I(s) MOA)” ANF 2G (ce ( en) 


7 Density theorems (strong form) 
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Part VI 


Automorphic Forms 
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Chapter 37 


Theta and elliptic functions 


1 Theta functions 


Definition 37.1.1: A theta function of degree n on [w1,w2] with parameter b ¥ 0 is an 
entire function f(z) such that 
Qrinzg 


fle+wr) = fz), Fle +w2) = be f(z). 


We aim to classify all such functions. For simplicity assume w, = 1 and w2 = 7, with 
Sr > 0. (Rescale.) 


Proposition 37.1.2: The space of theta functions of degree n and parameter b forms a 
n-dimensional space. They are in the form 


[o-e) 
S- ang 
k=0 


where q = €7", ag,...,@n—1 can be freely chosen, and the coefficients satisfy the recursive 
relation ss 
_ mp+7Pe—— —2Qnit 
Am+pn = b Pay Gm, de . 


DARN there are so many different definitions of the theta function. In particular, the 
following is a theta function of degree 1 and parameter b: 


(oe) 


6(2) = (-1)¥g > 2 = C(qo) TL (1 — aad — ag"). 


keZ n=0 


We have the following analogue of the fundamental theorem of algebra. 


Theorem 37.1.3: Any theta function of degree n is in the form 
f(z) = K@(z — 21)---O(z — 2n)q” 


for some 21,...,2n € Candr € Z. 
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1.1 Transformation law 
2 Elliptic functions 


Definition 37.2.1: An elliptic function on the lattice A is a meromorphic function f(z) 
on C such that 
f(iz¢t+w)=f(z) forallweA,z€C. 


Denote the space of all such functions by C(A). 
There are nice relationships involving the zeroes and poles of elliptic functions. 


Theorem 37.2.2: Let f be an elliptic function on A. 
1. wec/A Res,,(f) = 0). 


2. Lwecsa Ordy(f) = 0, ie. in a fundamental parallelogram there are as many zeros as 
poles, counting multiplicities. 


3. Lwecsa rdw (f)w € A. 
Proof. if 
2. 
3. Label the edges of the fundamental parallelogram as follows. 


C2 


We calculate {sp =f! dz in two ways. 


Way 1: 
O._[f #@Qe.¢ QO). fQe.f #Oe 
he tay = [den geey &t bes a] +(f, iO. er FO) a]. 


Noting that C3 is just C, shifted by w, and reversed, and that C2 is just Cy shifted by 
Ww and reversed, this equals 


[,fde= [FOE anf 


Since f is elliptic, f(z) = f(z +41) = f(z +2), giving 


f(z) fle +1) 
“#'(2) _ [| [Og 4 f fa 
hs f(z) ileal fe “ > Fey @ 
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Now In(f(z)) can be defined in a neighborhood around C; and C4, since f has no poles 
or zeros on OP. Since f(a) = f(at+uw1) = f(a+we), we have In(f(a+w1))—In(f(a@)) = 
27ic; and In(f(a)) —In(f(a+we)) = 27ic. for some integers c; and cy. But these equal 
the above integrals by definition of In f(z), so 


2F @) = —271(W Cc, + Wee 
is f(2) dz => 2 ( {C1 + 2 2). (a1) 


Way 2: Note Res, ne = ord, f so Res, se = aord, f. Letting a, be the poles and 
zeros of f in P, we get by Cauchy’s Theorem that 


zf'(z) f(z) 
= 2ni) Res, —~ = 271) mya. 37.2 
Jp Flay = 2 LR Fey = PML ma oa) 
Equating (37.1) and (37.2) give 
JS" Mpa, = —W1C, — W2C2 =O (mod A). 
k 


Definition 37.2.3: The order of an elliptic function is the number of poles in a fundamental 
parallelogram. 


It turns out that elliptic functions can be expressed as quotients of theta functions. 


Theorem 37.2.4: 


3. Weierstrass o-function 
Our basic example of an elliptic function is the following. 


Definition 37.3.1: Define the Weierstrass o-function for the lattice A by 


224. S 


2” NE A\{O} 


Proposition 37.3.2: The series defining g converges absolutely and locally uniformly on 
C — {A}. is an even elliptic function with period A, analytic except for a double pole at 
each point of A, 


In fact, we will see that it is the building block for all elliptic functions. 
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Proof. 


Theorem 37.3.3: Every even elliptic function can be written as a polynomial in g. Every 
elliptic function can be written as a polynomial in ¢ and ¢’. 


Theorem 37.3.4: 
O(z+a)0(z—-a)  6'(0)? 


2) = Ty Haye =a) 


Theorem 37.3.5 (Weierstrass differential equation): 
p'(z)” = 4(@(z) — e1)(@(z) — €2)(@() — es) = 40(z)” — 60G4 @(z) — 140G6(z) 
KH’ a 
92 93 


This says that for every z, the point (g(z), @'(z)) lies on the elliptic curve y? = 4x? — 
60G4—140G.. Together with surjectivity and the Uniformization Theorem|37.3.6|this implies 
that all elliptic curves can be parameterized in this way. (NONZERO DISC.) 


Theorem 37.3.6 (Unifomization theorem): unitormization Let A,B € C satisfy A? — 27B? 4 0. 
Then there exists a unique lattice A C C such that go(A) = A and g3(A) = B. 


3.1 g and lattices 


Theorem 37.3.7: Let L be the lattice corresponding to o(z). For a € C\Z, the following 
are equivalent. 


1. (az) is rational function in ¢(z). 


2,.aLCL. 


3. There is an order @ in an imaginary quadratic field K such that a € @ and L is 
homothetic to a proper @-ideal. 


Then 
plaz) = Br 
for relatively prime polynomials A and B such that 


deg(A) = deg(B + 1) = [L: aL] = Na. 


Proof. (1) = > (2): Suppose that g(az) = ey with A and B relatively prime. Then 


B(9(z)) (az) = A(p(z)). (37.3) 


For any w € L, o(w) has a pole of order 2, and each linear factor o(z) +r of A(g(z)) and 
B((z)) has a pole of order 2. In particular, for w = 0, we get that the order is 


2 deg(B) + 2 = 2deg(A) 
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showing that deg(A) = deg(B) + 1. Now take any w € L. Counting the order of w on both 
sides, we find that g(az) has a pole of order 2 at w. Thus aw € L. This shows aL C L. 
(2) => (1): For any w € L, since aL C Lwe have 


p(a(z + w)) = paz + aw) = e(a2). 


Hence ¢(z) is elliptic with L as a lattice of periods. Since it is even, by (?) it is a rational 
function in ¢. 

(2) = > (3): By a homothety we may suppose L = (1,7). Since L has rank 2 as a 
Z-module, T must be of degree 2 over Q. Now take 


6 ={B € Q(t): BLCL}, 


i.e. the “codifferent.” 
(3) = > (2): Easy. 


Now, supposing (1) is true, rearrange g(az) = ean to get 
A(z) = g(az) B(x) =0. (37.4) 


Fix z so that 2z ¢ +L and such that A(x) — 9(az)B(2) has distinct zeros. (Claim: Given 
polynomials A, B, there are only a finite nmber of values of c so that A — cB has multiple 
roots.) Let {w;} be a set of coset representatives for L in +L. We claim that the roots 


of (37.4) are exactly z+ uj. 


We have 
A(p(z + wi)) — paz) B(plz + wi)) = A(olz + wi)) — p(a(z + wi))B(olz + w;)) = 0 


by blah, so o(z + w;) are roots of (37.4). 

Now if o(z+w;) = e(z¢+w,;) then by BLAH, (z+w;) = +(z+w,;) (mod L), giving either 
22 =U; — Wye 1L and 2z € i, or w; = w; (mod L). The first is impossible by assumption 
on z, so 2 = 7. This shows the roots are distinct. 

Finally, given any root of (37.4), by surjectivity of o we can write it in the form g(y). 
We have 


play) = Ba) = (az), 


where the first equality is by definition of A and B and the second is because ¢(y) is a root 
of (37.4). Then by BLAH, ay + az =0 (mod L). Since g is even, we may replace y by —y 
as necessary, to get a(y — z) = 0 (mod 4L). Thus y € 2+ +L and g(y) = 9(z + w;) for 
some 7, as needed. 


Since (37.4) has [LZ : +L] = [aL : L} roots, (37.4) and hence A has degree [aL : L]. 


Note the equivalence (2) <= (83) (which incidentally has nothing to do with elliptic 
functions) gives that a lattice is a proper fractional ideal of @ iff it has @ as its ring of complex 
multiplication. Nonzero fractional ideals are homothetic iff they determine the same element 
in the ideal class group. Hence there is a correspondence between IDEAL CLASS GRP and 
homothety classes of lattices with @ as full ring of complex multiplication. 
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Chapter 38 


Modular forms on SL9(Z) 


1 SL2(Z) and congruence subgroups 


Definition 38.1.1: SL2(Z) is the group of 2 x 2 integer matrices with determinant 1. 


d 


SL(Z) := (e 1) :a,b,¢c,d € Z, ad—be=1}. 


Define PSL2(Z) = SL2(Z)/{+1}. Define the following subgroups: 


T(N) = {M € §L,(Z):M = ¢ 
T,(N) = {M € SL:(Z):M = ¢ 


To(N) = {M € SL.(Z): M= i 


Any subgroup of SL2(Z) containing I'(V) for some N is called a congruence subgroup. 


; (mod ny} 
) (mod ny} 


‘) (mod yy} 


Definition 38.1.2: SL2(Z) acts on the upper half plane H by 


az+b 


(: Ve 
ed) “expand. 


We now collect some facts about SL2(Z) and its congruence subgroups. 


Proposition 38.1.3: The matrices S = (°, §) and T = (4+) generate SL2(Z). 
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1.1 Cosets 
Proposition 38.1.4: We have the following: 


1 
[SL2(Z) : To(N)] = Nit (1 + - | 
1 
[Po(N) : Ti(N)] = Nu (1 — = | 
[Pi(N):T(N)] = N 
Moreover, 
1. Set of coset reps for [o(V) in SL2(Z)? 
2. Let S = {(a,b) € (Z/NZ)? : gcd(a,b) = 1}. For each 
_ S=1(0,0)} 
(z,t)€ P:= TZND* 


take an integer matrix of the form (7%). These matrices form a set of right coset 
representatives for [9(.V) in SL2(Z). 


Proof. 1. Let G be the group 
{(a,y)|a € (Z/NZ)*,y € Z/NZ}/{£(1, 0)} 


with the operation 

(a, y)(a',y') = (aa’, ay! +a""y). 
The fact that G is a group can be shown directly, or by noting that the group structure 
on G is the “pushforward” of the group structure on ['9(V) by a below. We claim that 


13T(N) 31)(N) >G>1 


is a short exact sequence, where 


cee 2)) = (a,b) mod N. 


We verify: 


(a) m is surjective: Given (a,b) € G, we can choose b so that a = a@ (mod N),b=b 
(mod NV) so that ged(a, b) = 1. Let d be an integer such that ad = 1 (mod NV). By 
Bézout’s Theorem we can find k, 1 so that ak—Ib = 444. Then a(d+kN)—Nlb = 
1, and the following matrix is in SL2(Z). 


ir (@ uae) Fulah): 
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(b) ker(7) = T(N): The inclusion [(N) C ker(z) is clear. Conversely, if A = 
ce 


moreover ad — (Nc)d = 1 and a =1 (mod N) imply b= 1 (mod N). 


First suppose N # 2. Then |G| = $y(N)N, so 


7, [PSL(Z):TO)) Tew 1-2) _ 1 
[PSL2(Z) : To(N)] = a ee ol (1 -: =| 


BR 


For N = 2, [PSL2(Z), P(.N)| = 6 and |G| = 2, so [PSL2(Z) : To(.NV)] = 3 (and the above 


formula works as well 


1.2 Useful decompositions 


Bruhat 


1.3. Fundamental domains 


Definition 38.1.5: Let H be a subgroup of SL2(Z). A fundamental domain for H is a 
subset of H such that the following hold. 


1. 


2 Modular forms 


Definition 38.2.1: A modular function on SL2(Z) is a function f : H — C such that 
1. f is meromorphic on H. 


2. f satisfies the following transformation property. 
f me z} = (cz +d)* f(z) for all ane € SL, 
ed c d : 


If moreover f is holomorphic on H we say f is a weakly holomorphic modular form, and 
if f is holomorphic on H* = HU{oo}, we say that f isa modular form. (f is “holomorphic 
at oo” if f has a Fourier expansion with nonnegative exponents 


f(z) == ss ang”, q= ene) 


n>0 
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We say f is a cusp form is ag = 0 above. We denote 


Mj, = weakly holomorphic modular forms of weight k 
M;, = modular forms of weight k 


S; = cusp forms of weight k. 
Note we will generalize this definition several times (add references when I put them in) 
Theorem 38.2.2 (Weight formula): Let f be a modular form of weight k. Then 
k = 6ord,(f) + 40rd,,(f) + 120rdioo(f) +12 S© ord,(f). 


ze Rp 


Proof. Don’t feel like writing... will be vastly generalized using Riemann-Roch anyway. 


3 Ejsenstein series 
The following will be our most important source of modular forms. 


Definition 38.3.1: Let k > 4 be even. Define the Eisenstein series of weight k as a 
function on lattices to be 


1 
weEA\{O} 
Define the Eisenstein series as a function on H to be 
1 
Gil 2): = Gel (2) = Gate 


(a,b) €Z?\ {0} 


Define the normalized Eisenstein series as Ey =?Gp. 
Note that if k is odd, G; as defined above will be 0. 
Proposition 38.3.2: G, is absolutely convergent, and is a modular form of weight k. 


Theorem 38.3.3: The Fourier expansion of FE; is 


ge 


E,(z) =1- B, 2, %4-1(n)q" 


eq = 1+ Yi at": 


where B; is the kth Bernoulli number: = 
Definition 38.3.4: Define 
sak 
1728 


as a function either on lattices or on H. 
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A is a cusp form of weight 12, normalized so its first term is z. As we will see, it spans 
the space of cusp forms of weight 12. 
The functions G4, Gg parameterize elliptic curves over C. (See...) The following will be 
important in establishing a connection between elliptic curves and lattices. 
Theorem 38.3.5 (Uniformization theorem): The map I + C?\{A = 0} defined by 
Dt (Ga, Gs) 


is surjective (bijection’?). 


4 The spaces M, 


Theorem 38.4.1: The set 
k 
{ Bx-toed" :0 < r < EA Be — 12r f- 2| 


is a basis for M;,. Thus 


dim(M,) = | 


5 Dedekind eta function 


Theorem 38.5.1 (Transformation properties of 7): The function n(r) = q3 TI%,(1 — q”) 
satisfies 


n(t + 1) =e? nr) 
—l T 
n(=) = [7am 
T i 
There are two main ingredients to the proof. 


1. Derive transformation properties for twisted theta functions 6, using the Poisson sum- 
mation formula. 


2. Write 7 in terms of theta functions using the Pentagonal Number Theorem ??. 


Proof. For the first part, note 


I 
tov) 
an 
ic) 
18 
—— 
be 
| 
tay) 
i) 
y 
— 
3 
rs 
3 
—— 
4 
7 


sie 4s 1) =e Mr Ul (1 = eur) 


n=1 n=1 
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For the second part, recall the transformation formula for the theta function (Proposi- 


tion B4]34.2.4) 


2rie 


6,(r) = see dg. (=) (38.1) 


where y is a primitive multiplicative character modulo r. 
By the Pentagonal Number Theorem, 


I 
—— 
| 
re 
iQ 
wo 
cy 
“yw 
tol 
Bla 
nN 
3 
pre 
a 


3 
a) 
N 


I 
—— 
| 
pay 
er 
3 
° 
a 
a 
3 
+ 
mH 
mS 
— 
i) 
rN 
a 


N 


3 
Nl A 


(scapes) eapestonnGa 


—T 
« (Gr) oP) 
where x(7) is the character modulo 12 taking values 1, —1, —1,1 at 1,5,7, 11, respectively. 
First note G(y,e*") =es —es —et tee 


D 


6 = 2/3. Hence 
1 
1(-7) = % (aoe) by @Q 
Ghe= 12 


 12,/i/(12r) 
—1T 127 
= 372. (a) 


= V~irn(r). by 


6 Derivatives of modular forms 


Let f be a modular form of weight k. Is f’ (derivative with respect to 7) a modular form? 
Differentiating the transformation law gives 


f(SE8) = ter eater) 


(22 


a 7) (er + d)~-? = k(cer +. d)*""cf (r) + (er +. d)* f'(r) 
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,{aT+ b _ n 
bi (2 53 ;) = her es aac +(cr + d)*** f'(r). (38.3) 


Unfortunately, f’ isn’t quite modular. So we need to construct a modified notion of derivative 
(which we’ll call @) that takes M; to My42. To do this, we will use the derivative and the P 
function, defined below in terms of the 7 function. 


Definition 38.6.1: Define 


24 9! 
pejya es 1). 
2ni (7) 
Theorem 38.6.2: 
1 P= Eo, i.e. 
_ 4 Sou 
~ By n=1 
we 
24 
2. P satisfies the transformation law 
12c 
P(qr) = (er + d)?P(r) + ail ctT+d) . (38.4) 


“nonmodular” part 
Proof. For item 1, note that = = niga, by the chain rule so 
ro 


d oa d 
eas — ) pases — gr ae 24 
= Ins) = 2rig (s s . In(1 — gq”) + iG ng 


wa 720 (Sarma tal 


-m( S- qe +a) 


n=0 m>0,n|m 


-mi( So i(m)q”™ +z): 


m>1 


For item 2, note (S, 7’) = GL2(Z), so y can be written as a product of S = G a a oe 


(j i) hy ia (j i): The base case is trivial. For the induction step, first differentiate 


the transformation laws for 7 to get 
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24 


ee transforms under 7. The induction step comes from 


Using this we can calculate how 
checking that if y = (: ) then 


Poop“ teh 


2710 
P(T*"yr) = P(qr). 


Now we are ready to define our differential operator. 


Definition 38.6.3: For f a weight & modular form, define 
On(f) = (120 — kP) f 


where 
d 1d 
9 = q— = ———. 

dq  2nidr 
Theorem 38.6.4: 

1. Oj, is amap from M; to Mp4. 

2. O is a derivation, i.e. for f € Mn,g € Mn, we have 

Oman d 9) = (Out )g 5 f (Ong): 


3. The following hold (P = £y,Q = Fy, R = Eg): 


on,P = —Q oP = —(P?—Q) 

0,Q = —4R 6@ = 5(PQ-R) 

O.R = —6Q? OR = 5(PR-Q?), 
Proof. For part 1, calculate (Of)(AT) using (38.3) and (38.4). 


For part 2, 


Omen F9) = 5--(F9)'—(mtn)P fg = = —f'9-m(PH)gt+5—fa'—nf(Pa) = (Omf)9+F (Ong): 


For part 3, more calculations show that 0:P + P? is a modular form. The equalities 
follow from using dim(M,) = dim(Mg) = dim(Mg) = 1 and matching constant terms of the 
q-series. 


Remark 38.6.5: Since Q, R generate the space of modular forms, this completely describes 
the action of 0 on modular forms. The fact that it is a derivation means that we can calculate 
its action on a polynomial in P,Q, RF as if it were actually a derivative, taking note what 
02P, 04Q, OgR are. This is since for polynomials, stuff like the chain rule can be derived from 
the product rule, which we have. 
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7 The j-function 


Definition 38.7.1: Define the j-function (on lattices or H.) by 


_ EF 
=a 


Since E} and A are modular forms of weight 12, 7 is a modular function of weight 0. 
The function 7 has some very nice properties. 


Theorem 38.7.2: j takes on every value in C exactly once in its fundamental domain. 
in/excluding boundaries the right way 


Theorem 38.7.3: A function on H is a modular function of weight 0 if and only if it is a 
rational function of 7. 


7.1 The modular polynomial ©,, 


Definition 38.7.4: Define ®,,(X,Y) so that ®,,(j,Y) is the minimal polynomial of j(Nz) 
over C(j). 


Note this is well-defined because C(j) = C(X). 

This will be important when we define the moduli space of an elliptic curve, because 
(j(z),7(Nz)) will map the moduli space to an algebraic curve whose associated function 
field is C(j(z), j(N2)). 

Proposition 38.7.5: The following are true. 

1. ®,,(X,Y) € Z. 

2. ®,,(X,Y) is symmetric for m > 1. 


3. (Kronecker’s congruence) If p is prime, then 


&,(X,Y) =(X?— X)(Y?—Y) (mod p). 


4. If m is squarefree then ®,,(X, X) has leading coefficient +1. 
Proof. ie 
2. F(X,Y) = F(Y,X): Replacing z with —3; in 


F(j(z),j(N2)) = 0 
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gives 


r(-t)u(2) <0 


Note that j is invariant under y = (°,§) € SL2(Z) which sends z to —$. 


j (-x) = FUN Z) 4 (—2) = j(z), and we get 


F(G(N2z), j(2)) = 0. 


Hence 


Since F(X, Y) is irreducible in C[X,Y], so is F(Y,X). Then F(Y,7) is also the irre- 
ducible polynomial of Y over C(j), so replacing j with X, this says that F'(Y, X)|F'(X, Y). 
The only way for this to happen is if F(X,Y) = cF(Y,X). We have F(X,Y) = 
cF(Y,X) =F(X,Y), soc=+1. If c= —1, then F(X, Y) = —F(Y, X), and putting 
X =Y gives F(X,X) = 0. This shows X — Y|F(X,Y), which is impossible since 
F'(X,Y) is irreducible with degree [[(1) : To(NV)|] > 1. Thus F(X,Y) = F(Y, X). 


Lemma 38.7.6: Let 71,...,7p41 be coset representatives for [['(1) : To(p)]. Then 


Cilon2),---derH2)} = GP} Uj (=) 0<k<ph. 


Proof. There are indeed p+ 1 coset representatives because pt = N [I prime gl (1 “ ) = 


p+1in this case. Given y = (75), we have pyz = ( pb) z. For any 7’ € I'(1), we have 


j(y'pyz) = j(pyz) since 7 is invariant under ['(1). By Lemma 6.3.1 we can multiply 
(= pb) on the left by some matrix in ['(1) to get some GA se with a’d' = det ( p>) =p 
and 0 < vb! < d’. The p+ 1 possible matrices are (2°) and co for0 <k <p. We 
claim that all these are in fact attained. Let M be one of these matrices. Then by the 
Elementary Divisors Theorem there exist A,B € I(1) such that AWB = (3 ae But 
then M = A7!NB, so j(Mz) = j(A7'NBz), and we could have picked B as a coset 
representative (the choice doesn’t matter anyways). The lemma follows upon noting 


that (2) 2 =p and (ae ee a 


Let ¢, be a pth root of unity. We have that 1 — ¢,|p: indeed 


pon Peele = LG) eC”). 


z+k 
Pp 
j(z). However, roots are unity are congruent to 1 modulo p, since C -1=(q- 


When we expand 7 ( : its coefficients are roots of unity times the coefficients of 
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1)(¢h-1 +---+1). Then 


F((2),Y) = Te — §(4s02)) 
- w -iea) TE (¥-1()) 


the last equation following because raising the 7 function to the pth power is the 
same, modulo p, as raising each term to the pth power, and the coefficients (which are 
integers) are not affected modulo p, while the exponents are multiplied by p. Replacing 
j(z) by X we get 


POY Sa SPP a a ere yer Reve XY * med oC): 
Pp 


However, (1 — ¢,) 1Z = (p) (it contains (p), and (p) is maximal in Z), and we know 
F'(X,Y) has integer coefficients, so congruence holds modulo p. 


8 j and Hilbert class fields 


Our main theorem in this section (Theorem ??) is that values of the j-function at quadratic 
integers (or equivalently quadratic ideals) generate Hilbert class fields of quadratic exten- 
sions. To prove this we first need a result on 7 in terms of lattices. 


Definition 38.8.1: A cyclic sublattice L’ C L is a lattice such that L/L’ is a cyclic group. 


Theorem 38.8.2 (Correspondence between roots of ® and cyclic sublattices): Let m € N. 
The following are equivalent. 


1 OO: 


2. There is a lattice L with cyclic sublattice L’ C L of index m such that u = j(L’) and 
v= j(L). 


We first characterize cyclic sublattices. 
Lemma 38.8.3: The cyclic lattices of (1,7) are exactly those given by 


eae, é ) Zh), (38.5) 
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where 


Clim) = (5 1) :ad=m,a>0,0<b< d,gcd(a, b, d) =i, 


Moreover, these give rise to distinct lattices. 


Proof. Suppose L’ = (d,at + 6). Then the presentation of the Z-module L/L’ is given by 

d, 0 
0 a) SL2(Z) for 
some dj | dz and that L/L’ = Z/d,\Z x Z/d2Z. Note that multiplying by a matrix in SL2(Z) 
preserves the gcd of the entries. Hence we find that d, = gcd(a,b,d). Hence 


(45). By the structure theorem for modules, we have (45) € SL2(Z) ( 


L' is cyclic <=> ged(a,b,d) = 1. (38.6) 


This shows that all lattices in the form (38.5) are cyclic. 

Now given a cyclic sublattice L’, let d € N be the smallest integer in L’, and a + br be 
such that L' = (d,at + b). We may change b by a multiple of d so that 0 < b < d. Since 
m=|L21/|)=|¢%|=ad and ged(@,6,d) = 1 by (38.6), (25) € C(m). 

Uniqueness follows since d is the least positive integer in L’ = (d,at + 6), and once d is 


determined, a = 7 and b are determined. 


Proof of Theorem[88.8.4 By Lemma |38.8.3} when L’ = {d,a + br], letting o = (2°), we 


have 
H(L’) = j(d{1, o7]) = 3([1, 7). 


Then 
®n(X,j(7))= J] (X -5(e7)) = II (X=gL)). 


a€C(m) L’ cyclic in L, [L:L’)=m 


Hence any pair (j(L),7(L’)) is a solution; conversely, given a solution (X,Y), we have 
Y = j(L) for some L, and the above gives X = j(L’). 


Theorem 38.8.4: Let @ be an order in an imaginary quadratic field and aa @-ideal. Then 
j(a) is an algebraic integer and K(j(a)) is the ring class field of @. 


Proof. Let M = K(j(a)) and L be the ring class field of @. 
Step 1: Suppose aa is a cyclic sublattice of a; let m= N(a). We have 


Pm(I(4),5(4)) = Om(y(oa), j(a)) = 0, (38.7) 


where the first equality is by Theorem |38.8.2}and the second is because a and aa are similar 
lattices. Hence j(a) is a root of ®,,(X, X). 
Pick a so that Na is squarefree. To do this we note that by Theorem ??.?? 


Spl(L/Q) = {p prime : p= N(a) for some a € G}. (38.8) 


Choosing such a, we have [a : aa] = N(a) = p, so aa must be cyclic. Then the leading 
coefficient of ®,,(X,X) is +1 by Proposition (38.7.5), so j(a) is an algebraic integer. 
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Step 2: We show M = L by examining how primes split in L and M, i.e. we show 
Spl(M/K) = Spl(L/K) and use Theorem ??|29.3.9| First we show Spl(M/K) ® Spl(L/K). 
Take p C Spl(Z/Q). The idea is to use Kronecker’s congruence: We know that we have 


a’? =a (mod p) for everyacF <— F=F,. (38.9) 


When we have X, Y equal to values of j in a field extension M/K and ®,(X,Y) = 0, then 
this congruence gives us information about the residue field of M. We will find that it equals 
F,, so M/k is unramified, giving that p splits completely in L. 

By (88.8), for all but finitely many p € Spl(L/Q), p = N(qa) for some a € @. As in (38.7), 
we get 0 = ®,(j(a),j(a)). By Kronecker’s congruence, 0 = —(j(a)? — j(a))? (mod p), so 


j(a)? = j(a) (mod p); (38.10) 


a fortiori this holds modulo Sf. 

Next note @x[j(a)] has finite index in @y, because the fact that M = K(j(a)) gives it 
is a full lattice in @y (considering them as Z-modules). 

Now assume p { [@xz : @x[j(a)|]; we will show that implies the congruence 


a’ =a _ (mod $) (38.11) 


fora € Gy. First, take p = $B K, and note that p € Spl(//Q) implies that the residue 
degree of 8 is p, and hence a? = a (mod p) and a fortiori modulo $B for a € Cx. So (38.11) 
holds for a € @|j(a)]. Now for arbitrary a € Gy, letting N =[@y : Cx|j(a)]] we know 


(Na)? =Na_ (mod $); 
in particular, N? = N (mod $f); 


But p{ N means N is invertible in m := @),/8, so dividing these two equations gives the 
desired answer. 


Now by (38.9), (38.11) gives that |m| =p, ie. f(8/p) =1 and p € Spl(M/Q). 


From this step we obtain M C L. 


Step 3: Next we show Spl(M/Q) © Spl(L/Q). Take p € Spl(M/Q); assume p unramified in 
M and relatively prime to 
A= [I (ia) —3(6)). 


{a,b}ECK 


(Note this is in @;, by step 2.) Using the description of Spl(L/Q) given in step 1, it suffices 
to show p = N(a) for some a. 

We have f($8/p) = 1 for some $8 in M above p. Let 9’ lie above Bin L. Let p = 
BO Ox; we see f(p/p) = 1 so (p) splits as pp in K and Np = p. Hence pa is cyclic in a. 
Theorem (38.8.2) and Kronecker’s congruence give 


0 = (i (pa), j(a)) = (9(a) — j(pa)”)(G(pa)? — j(a)) (mod p); 
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this holds modulo 58’ as well. Hence we have 


j(a) = j(pa)? (mod P’) or (pa)? = f(a) (mod $f"). 


By assumption, f(8/p) = 1, so 6; /$B = F, and j(a)? = j(a) (mod $’). Together with the 
above we find that’ 


j(pa) = j(a) (mod $f’). 

If a, pa are in distinct ideal classes, then 38’ | j(pa) — j(a) | A, contradicting the fact that p 

and A are relatively prime. Thus they are in the same ideal class, and p = (q@) is a principal 

ideal. This means p = Na is in [88.8] as needed. 
Combining steps 2 and 3 gives L = M. 


9 Hecke operators 
Hecke operators give a map on modular forms. We first define their action on lattices. 


Definition 38.9.1: Let £ denote the set of full lattices in C, and K = Z®* denote the free 
abelian group generated by the elements of £. Define the Hecke operator on K by setting 


T(n)[A] = x [A 
NEL, [A:A/]=n 


and extending linearly. 


The sum is finite because any A’ in the sum must contain nA, and A/nA is finite. We 
may think of modular forms as functions on lattices f(z) = F((1,7)), hence T(n) induces a 
map on the space of modular forms of dimension k, M;: 


T(n)- f(r) =n F(T(n)P(,7)). 
Note the constant n*~! is just to make formulas nicer. 


Proposition 38.9.2: T(n) is a map M, — Mg, and restricts to a map on cusp forms 


b 
d 
n*-1 F(T(n)T(Ar, 1)) 
= n* FIT (n)(cer + d)T'(ar + b, cr + d)| 

n*-l(er + d) *F[T(n)I(ar +b, er + d)| F homogeneous, 
= (er +d) *n* 1 F(T (n)I(7, 1) (7,1) = (ar +-b,cr +d) 
er + d)-*T(n) - f(r). 


Proof. Let A = (: ) € SL2(Z). We have 


T(n) + f(Ar) 


In the first case we can take pth roots because p 1 |@,/%"|. 


536 


Number Theory, 838.10 


In the following subsections, we prove several key properties of the Hecke operator, and 
the Hecke algebra (the algebra generated by the T(n)). 


e The operators T(n) are multiplicative. 
e The Hecke algebra is commutative. 


e The Hecke operators (on modular forms) are self-adjoint with respect to the Petersson 
inner product. 


We will prove the last two items more generally, for a generalization of the Hecke operators, 
T, where a is a matrix. We will then compute the explicit action of T(n) on the Fourier 
coefficients of modular forms. The main application of Hecke operators is that we can 
diagonalize M;, with respect to the Hecke algebra; thus we can speak of ezgenfunctions in 
M,. Using the multiplicativity of T(n), we how that the coefficients of these eigenfunctions 
are multiplicative. 


9.1 Hecke operators on lattices 


Definition 38.9.3: Define R(n) :K — K by 


R(n)[A] = [nA]. 
Theorem 38.9.4 (Multiplicativity of Hecke operators, I): For any m,n, 
mn 
T(m)T(n)= SD dR(A)T (=) 


d|gcd(m,n), d>0 
In particular, the following hold. 
1. Ifm Ln, then 


2. If pis prime and r > 1 then 
T(p")T (p) = T(p"™"*) + pR(p)T(p"™"). 
Translating these properties to modular forms we get the following. 


Theorem 38.9.5 (Multiplicativity of Hecke operators, II): For any m,n, 
Timr(nf= SY d& 7 (=) f. 
dlgcd(m,n), d>0 
In particular, the following hold. 
1. Ifm Ln then 
T(m)T(n)f =T(mn)f. 
2. If pis prime and r > 1, 


T(p)T(p") =T(p"*") f +p 'T(p"")f. 
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10 Simultaneous Eigenforms 


Definition 38.10.1: A simultaneous eigenform is a modular form f that is an eigen- 
function for every Hecke operator T},. 


Write 


We know that 
(ag )) = > Yn(m)q”™ 


m>0 


mimj= S| dd’ *¢ () 


d|gcd(m,n) 


where 


To find properties/criteria for eigenfunctions f, we compare: 


f(r) 
(Tn f)(7) 


First, we consider the nonvanishing of c(1). Keep the above notation. 


c(0) + c(1)q+-::- (38.12) 
o,-1(n)c(0) +e(n)g+---. (38.13) 


Theorem 38.10.2 (Apostol, 6.14): Suppose k > 4 is even, and f € M, is a simultaneous 
eigenform. Then 


c(1) £0. 
Proof. Let A(n) denote the eigenvalue corresponding to f for T;,. From (38.12) and (38.13) 


we get 
en) =A )e(1). 


If c(1) = 0 then c(n) = 0 for all n, so f is a constant, contradiction. 


The previous theorem allows us to normalize a simultaneous eigenform so c(1) = 1. 


Theorem 38.10.3 (Simultaneous eigenforms have multiplicative coefficients): Suppose 


f(r) = do e(n)q” € Sp 


n>1 
with k > 12 even. Then the following are equivalent. 
1. f is a simultaneous normalized eigenform. 
2. For all m > n, 


emeni= > de (=) 


d|gcd(m,n) 
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Moreover, 
An) = en) 

Proof. Again from (38.12) and (38.13), if f is a simultaneous eigenform we have 
A(n) = c(n) 


Now A(n) f(7) = (Ti f)(7) is equivalent to 


e(nye(m) = Xne(m) =a(m)= ate (ME) 


d|gcd(m,n) 


for all m,n > 1. 


10.1 Examples 
We can use Theorem |38.10.3}to conclude the multiplicativity of the coefficients T(n) of A. 


Corollary 38.10.4: Write A(7T) = 0°) 7(n)q". Then 


Tint) = S- ds (=) : 


d|gcd(m,n) 
In particular, 


T(mn) = T(m)r(n) when m Ln 


Theorem 38.10.5 (Noncuspidal eigenforms): The only normalized simultaneous eigenform 


in Mo, = Sor is Hee Fog. 


Proof. The fact that ae FE, is a normalized simultaneous eigenform follows from Theo- 
rem (38.10.3). (The conditions there hold by simple calculation.) 
Suppose f(T) = Vinsoc(m)q™ is a normalized simultaneous eigenform. Use (38.12) 


and (38.13) to match coefficients in A(n)f(7) = (Tr f)(7). We get 


(1) eBY = on-1 (ney 
A(n)c(1) = c(n) 


So the only possibility is \(n) = o,~1(n), and this completely determines all the c(n) by the 
second equation above. (Then only one value of c(0) will work.) 
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11 Existence 
Theorem 38.11.1: There exists a basis of simultaneous eigenforms for Moz. 


Proof. Since we already have a simultaneous eigenform in Mo, — So, and dim(M2,) — 
dim(S ,) = 1, it suffices to show that there is a basis of simulatenous eigenforms for So,. 
We proceed in three steps. 


1. Define the Petersson inner product on 5S»; by 


(fo) =f reoate". 


(Here 7 = x 4+ yi.) It’s clear that this is positive definite. Note the following: 


(a) “* is the Haar measure with respect to SL2(Z) (it is invariant under the action 
of SL2(Z)). 


(b) f(r)g(r)y* is invariant under transformation by SL2(Z): Using 


Sh AN OO 
S(Ar) = jor +d 
we get 
f(Ar)g(At)(SAr)* = f(r)(er + d)*g(r)(er + d)-*§ —_S = f(r)gryy". 


ler + d|?* 


(c) The integral converges. Since f is cuspidal, f(t) = O(e7') = O(e-¥). Thus the 
integral is dominated by 


iE i Ce Yy*? dx dy < o. 


3 
2. The Hecke operators T;, are self-adjoint under this inner product, i.e. 
(Tf, 9) = (Ff, Tng) - 


(See pg. 82-86 of Brubaker’s notes|http://math.mit.edu/~brubaker/785notes. 


pdfl) 


3. We use the following linear algebra theorems. 


Theorem 38.11.2 (Spectral theorem): A self-adjoint linear operator on a finite- 
dimensional C-vector space has an orthogonal basis of eigenvectors (so is diagonal- 
izable). 
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Theorem 38.11.3: Let F be a commuting family of diagonalizable linear operators 
on a finite-dimensional vector space. Then F is simultaneously diagonalizable. 


Since the Hecke operators commute, the two theorems, combined with item 2, give the 
desired result. 
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Chapter 39 


Height functions 


Kob84], {Sil86], [HS00}, [Sil07 


1 Heights on projective space 


Let AK be a number field. We aim to define a function h on P"(A’) with the following 
properties. 


1. There is a bounded number of points with small height. 


2. The height encases nice arithmetical and geometrical information about the point, and 
behaves well under rational maps. 
It is natural to define the height in terms of the absolute values, or places, on K. The finite 
places will capture how divisible the coordinates of a point P are by various primes, while 
the infinite places captures the more geometrical notion of distance. We will thus define the 
height as a product over all places on K. 


Definition 39.1.1: Let K be a number field, and P = (2,...,2%,) € P"(K). Define the 
multiplicative height and logarithmic height of P to be 


Hx(P)= J] max{||xoll,,---> latnll,}"" 
veEeMK 
hx(P) = log Hx(P) = S> —nymin{v(xo),...,u(&n)} 
veEMK 


where n, = [K, : Q,]. (Recall that the normalized absolute value has ||2||,, = |a|?”.) 


Note that the value of Hx(P) is independent of the choice of homogeneous coordinates 
for P, because by the Product Formula ??]31.1| for any c € K* we have 


[J max{llexoll,,.--, lel. = IL lel, T] max{lleoll,..--, llenll. 


jews veMK veEMK 
= Il max [loll ssre~5 rll bs 
veMK 
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Note that for the case n = 1, we will often write Hx(x) to mean Hyx(1: x), and likewise 
for hx and the other height functions to be defined. 


Example 39.1.2: Suppose that P € P”(Q), and write P = (4p: ...: Z,) where ged(zo,...,2n) = 
1. Then 

A(P) = max{|Za\ 4,222 5 |@al | 
Indeed, for each prime p, one of Xo, ..., Zn is not divisible by p, so max{|Zo|p,.--,|n|p} = 1. 


The only factor that contributes is from the real place. 
For the special case n = 1, if ¢ is such that gcd(a,b) = 1, then we simply have 


H (5) = H(a: b) = max{|al, |b}. 


Proposition 39.1.3 (Elementary properties of height): 
1. Hx(P) > 1 for all Pe P"(K). 
2. If L/K is a finite extension, then 
H,(P) = Hx(P)*!, 


3. The action of the Galois group on P”(Q) leaves height invariant, i.e. for any o € 


G(Q/Q) and P € P"(Q), 


Proof. 
1. Scale the coordinates of P so that one of them equals 1. Then by definition, Hx(P) > 1. 


2. Use formula (??) by Lemma ??. 


3. The Galois group permutes the places. 


In light of item 2, we can define an absolute height on P”. 


Definition 39.1.4: Let P € P(Q). Let K be any finite extension of Q containing the 
coordinates of P. Define the absolute multiplicative/logarithmic height of P to be 


1 


H(P) = Hx(P)®2 


Define the field of definition of P = (xp: ---: 2) to be the smallest field K such that 
P € P(K). We have that 
x a. 
oP) = (2.2) 
vj Xj 


where j is any index such that x; 4 0. 


546 


Number Theory, §39. 1 


Theorem 39.1.5: For any B and D, the set 
{P € P"(Q): H(P) < Band [(Q(P):Q| < D} 


is finite. In particular, the number of points with height bounded by B in any fixed number 
field K is finite. 


Proof. Step 1: First note the theorem holds if we only consider points in Q, i.e. the set 
{P € P"(Q) : H(P) < B} 


is finite. Indeed, this follows from the characterization of the height on Q in Example}39.1.2 
and the fact that there are finitely many points in (ZN |—B, B])”. 


Step 2: Next, we reduce to the case n = 1, as follows. Choose coordinates of P so that x; = 1 
for some j. Then for any 7, we have 


H(P)= JT] max{llel,} > T[ max{ljeyll, 1} > H(e9. 
veéMg(P) —~7> véMg(P) 
Hence it suffices to show that 
{x €Q: H(z) < B and [Q{z] : Q| < D} (39.1) 


is finite. It will follow from this that there are finitely many choices for each x;, and hence 
a finite number of possibilities for P. 


Step 3: We would like to work with Q. To do so, we consider the minimal polynomial f of x. 
The lemma below shows that the height of the point formed from the coefficients is bounded 
in terms of the roots of the polynomial. A finite number of possibilities for f will mean a 
finite number of possibilities for x. 


Lemma 39.1.6: Let 
F(X) Sk” bag yk fe aS (X —11)---(X — 1a) — Q[X] 
be a monic polynomial of degree d. Then! 
d 
Hag: +++: @qa) < 2%" TJ H(r;). 
j=l 


Proof. We prove this by induction on d. The base case d = 1 holds by definition of H(qa). 
Suppose the lemma, proved for polynomials of degree d — 1. Let 


g(X) = hp4X? te thy (X —11)---(X — 1¢-1). 


‘A closely related quantity to the RHS is the Mahler measure of a polynomial, defined as M(f) = 
|aa| [Tj max(1, |xi]). 
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Then 
Oy = Tape HP Opts 


where for convenience b_, = 0. 


Let K be the field of definition for (ag : ... : a@,) and define 


Cy v € Mp. (i.e. v nonarchimedean) 


éy(m) := | (39.2) 


m, vu € M® (ie. v archimedean). 
By the triangle inequality, 


lazy < €y(2) max{|rabp|y, beso } 


< €,(2) max{|raly, 1} max{|bg|y, |bg—1|0}- 


Hence 


Max (laklv) S ev(2) max{|raly, 1} | max , |belv- 
Take the product over all v € Mx and noting that there are at most [AK : Q] archimedean 
places (since each corresponds to a real embedding or a pair of complex conjugate embed- 
dings), we get 

il max (|a,|,) < 22 il max {|bgly, L}. 


Pe aha a 
Raising each side to the power KO gives 
d 
Hg(ag tec @,) < 2H) AO,) < 2°" |] Ala) 


j=l 


where the last step follows from the induction hypothesis. 


Suppose z is in the set (39.1). Let f(X) = agX¢+--++ ao be the minimal polynomial 
of x, and 71,...,%q be the conjugates of x. Note d < D. Further noting that all conjugates 
of x have the same height (Proposition |39.1.3(3)), we have by the lemma that 


H(@g222.2 Gy) <2? T] Ha) So?“ a2? BP. 
jal 


This means all the coefficients a, have absolute value at most 2?-!'B?. This shows there 
are a finite number of possibilities for f and hence a finite number of possibilities for x. 


As a first application, we prove the following famous theorem of Kronecker. 


Theorem 39.1.7 (Kronecker): Suppose a € Q has all conjugates lying on the unit circle. 
Then qa is a root of unity. 
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Proof. First we show that H(a) = 1. To this end, let kK = Q(a). If v is a finite place of K, 
then |a|, = 1 since a is a unit. If v is an infinite place of K, then it is determined by a real 
or complex embedding, and |a|, = 1 by assumption. This proves our claim. 

It is easy to see from the definition of H that H(a”) = 1 for all n. Furthermore a” € Q(a) 
for each a. However, by Theorem there are a finite number of x € Q such that 
x € Q(a) and H(x) = 1. Hence a4 = a* for some j ¥ k, and a is a (k — j)th root of 
unity. 


Remark 39.1.8: It is informative to unravel the arguments leading to the theorem above. 
For each a3, we have that the minimal polynomial of f; has bounded degree; moreover it 
has bounded coefficients, simply because all conjugates of a? have absolute value 1. (This 
is essentially the argument in Theorem [39.1.5]) Hence there are a finite number of f;, and 
a! = a* for some j # k. 


2 Height functions and rational maps 
Next we consider how height transforms under rational functions. 


Theorem 39.2.1: Let ¢ : P” — P™ be a rational map over Q. Write ¢ = (fo,..-, fm); 
where the f; are homogeneous of degree d. Let Z = Z(fo,.-., fm), the subset of common 
zeros of the f; and D = P"(Q)\Z. Then 


h(¢(P)) < dh(P)+O(1) for all P € P"(Q). 


Moreover, if X is a closed variety contained in D (so ¢ defines a morphism X — P”), then 


h(@(P)) = dh(P)+O(1) for all P € X(Q). (39.3) 


In particular, if y is a morphism then h(¢(P)) = dh(P) + O(1) for all P € P(Q). 


Proof. Let K/Q be a finite extension contain the field of definition for @ and P. To obtain 
the upper bound on h(¢(P)) we calculate the valuations of the f;(P) and use the triangle 
inequality. Each f; can be written in the form 


file) = 3 aa 


e|=d 


Note there are cS terms in the above sum. Defining ¢,(t) as in (39.2), we get that by the 
triangle inequality that 


n+d : 
Joterle <eo(" 9) mata) ax (nl) 


and hence 


n+d d 
ax, cede <e0(" 4) masa) a (hel)! 
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Multiplying over all v € Mx, taking the [K : Q]th root, and noting that there are at most 
[AK : Q] archimedean valuations, we get 


n+d 


H(Q(P)) = HUHAP) 3 fol) < ("4 


) ((a.)\H(P 
where (a,) is the point with coordinates equal to the a.; note H((a,)) is a constant depending 
on ¢. Taking the logarithm gives the first part. 

For the second part, we will relate the height of P with the height of ¢(P) by writing 
powers of x; in terms of the f; by the Nullstellensatz. Let X = Z(g1,...,Qn‘). Since 
Z(fi,-- +5 fms Gis +++) Gm!) =X NZ = ¢, by the Nullstellensatz, 


iG Cen Ler eee | WA fissiey join sees One) = (Dig tisedep an) 


Hence there are polynomials pyi,..-,Pkm;Ukls+++>Ukm’ and e € N such that such that 


Peadi +++ + Pemdm + 9191 t+ +°* + demGm = Fy. 


By taking the terms of highest degree we may assume the p,; and q; are homogeneous. For 
any point P € X, we have g;(P) = 0 so the above becomes 


Pr(P)fi(P) + +++ + Pkm(P)fm(P) = xp. 


Let G be the point with coordinates equal to b where b is the coefficient of some px; OF dz,j- 
Since the px; have degree d, we see that |p.j(P)|v < |Glv maxi<j<n(|xj|v)°*. Taking the 
valuation and using the triangle inequality, 


Jel S o(m)|Gly max (larjlo)"* max (|Fj(P) lv). 
= max (|25jlv)" < eo(n)|Glo max (|fj(P) lo). 


Taking the product over all v € Mx and taking the [Kk : Qjth root gives 
H(P)* < mH(G)H(9(P)). 


Taking logarithms gives the desired result. 


This theorem has an immediate application to the dynamics of rational maps on number 
fields. Define a preperiodic point of a function f to be a point P such that there exist 
nen with OCP =e 


Theorem 39.2.2 (Northcott): Let ¢ : PY(K) > P%(K) be a morphism of degree d > 2 
over a number field kK’. Then the set PrePer(¢) C P (Kx) is of bounded height. 
In particular, the set of preperiodic points of ¢ in K is finite. 


Corollary 39.2.3: Let ¢ be a rational function on P!(K). There are a finite number of 
points P such that ¢”(P) = ¢"(P) for some m # n. 
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Proof. Theorem 39.2.1] gives us the lower bound 

h(o(Q)) = dh(Q) — C for all Q € PX(K). 
Suppose ¢™(P) = ¢™**(P). Then repeated application of the above gives 
h(o™(P)) = h™**(P)) > dh(g™**"1(P))—C > --- > d*h(g™(P)) —C(L+d+ 


Hence we get 


On the other hand, (39.4) also gives 
h(¢™(P)) > d™h(P) —-C(1+d+---+d™"). 


Putting these two bounds together gives 


C @ 
< < : 
WES a ae Pao =" 


The second part now follows from Theorem |39.1.5 


(39.4) 
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Chapter 40 


Diophantine approximation 


1 Approximation theorems 


Any real number can be approximated to an arbitrary degree by rational numbers. However, 
we would like these approximations to be “efficient,” that is, have good approximations 
without having denominators that are too large. Dirichlet’s theorem gives a measure of how 
well we can be guaranteed to do this. 


Theorem 40.1.1 (Dirichlet): Given a € R, there are infinitely many rational numbers 
. € Q such that 


In the other direction, it turns out that algebraic numbers cannot be approximated too 
closely by rationals. 


Theorem 40.1.2 (Liouville): ({) Let a € Q. There is a constant C := C(a) such that for 
every EQ, 
= 


—--—al> Zz 


| Pp 
q 


Q 


(Equivalently, for every < > 0, there are only finitely many : € Q such that F -- a & Ss.) 


Proof. Assume a ¢ Q. Let f be the minimal polynomial of a. 
Note that q" f (2) is a nonzero integer, so 


a(Ble Old 


On the other hand, by the Intermediate Value Theorem there exists x between : and a such 
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Pp 
eta! 


q 


1()|-(@)-a|=1 


Assuming F = a < 1, there is a constant C such that this is at most C 2 — a. Combining 
the above two inequalities gives 


> 
= Gqr 


=e 


| Pp 
q 


for all ® with z+, < 1, as needed. 
q 
In fact, Liouville’s Theorem can be made much stronger: d can be replaced by 2 + ¢€ for 
any ¢ > 0. This is the Thue-Siegel-Roth Theorem. We will state it for arbitrary number 
fields, keeping in mind that the case for Q is that described above. Recall that the natural 
measure of arithmetic complexity on K is the height function Hx (which in the case of Q is 
related to the numerator and denominator of the fraction). 


Theorem 40.1.3 (Thue-Siegel-Roth): Let AK be a number field, and a € K. For every C, 
there are only finitely many : € Q such that 


Remark on effectivity. 


2 Thue-Siegel-Roth Theorem 


Lemma 40.2.1 (Siegel’s lemma): For am x n matrix M let |M| = max ,<;<m |mi| 


l<j<n 
Suppose A € Matmyn(Z), with n > m. Let the row sums be 
A; = >) |aisl- 
j=l 
Then there exists a nonzero solution T = (t,,...,tn)’ of AT =0 such that 


IT] < (Cy-++ Cm) < (NIA). 


Proof. The key idea is to use the pigeonhole principle: Consider a set S' of T with |7T| small, 
say 


When 
[S|] >|{AT: Te S}], (40.1) 


504 


Number Theory, 840.3 


then there must be 7; and T, so that AT, = AT>, or A(T, — Tz) = 0. We can choose M 
large enough so that holds: because there are more unknowns than equations, the 
LHS grows faster in M. This value of M will give our bound. 

Let R; be the ith row of A. Note that fixing 7, 


( S- «| [El Se lS ( S- «| eat 
jlaig<0 jla;;>0 


so there are at most A; = |M]7_, |ai;| possibilities for R;,7’. Thus we have 
|S] = (M+ 1)" 
|{AT: Te S}| =(1+|M] Ai)---(14+ [MJ An) < Ar--+An(1 + | B])”. 


Taking M = (A,--:-A mn) ™m gives . As noted, using the Pigeonhole Principle gives the 
existence of T, and T> with AT; = oe pales the vector T; — T». 


3 S-unit equation 


Theorem 40.3.1 (S-unit equation): Let S C Mx be a finite set of places, and a,b € K™. 
Then the equation 
az + by=1 


has a finite number of solutions in S-units x,y € U(S)*. 


Proof. Let m be a large integer, to be chosen. Every solution is in the form x = aX™ 
and y = BY™ for a, 8 coset representatives in U(S)*/U(S)*™. There are a finite number 
of cosets since by Dirichlet’s S-unit theorem 2?[18.3.2] U(S) is finitely generated. Thus it 
suffices to show that each equation aaX™ + bGY™ = 1 has finitely many solutions. Let 
A=aa and B= bf. Then 

AX” BY” = 1. 


II € -) = _ 


cma 


Write this as 


where ¥ is a mth root of —8. 
Assume by way of contradiction that there are infinitely many solutions. We have 


I] 


(mal 


’ 


Y =|aal, 


we show that for some solution, this forces + to be too close to ¢y. Since Hx(Y) = 
Tl,eg max{1,|Y|?’}, we get |Y|, > Hx(Y) Bia for some v. (Why?) 
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Chapter 41 


Complex multiplication 


cn.cm In this chapter, we combine class field theory with the theory of elliptic curves, first 
to characterize the maximal abelian extension of K, then to illustrate the relationships in 
Section [29]7] for CM elliptic curves. We will assume basic facts about elliptic curves (for an 
introduction see Silverman Chapter II]]). 

We know that every elliptic curve over C has endomorphism ring either equal to Z or 
a quadratic order. In the second case, the elliptic curve is said to have complex multi- 
plication. This gives the elliptic curve a lot more structure. On one hand, it is useful 
algebraically—as we will see, torsion points of a CM elliptic curve give abelian extensions of 
imaginary quadratic fields. In general, because of the added structure, much more is known 
about CM elliptic curves than other elliptic curves, and they can act as a kind of “testing 
ground” or “first case” of general conjectures. 

On the other hand, CM elliptic curves have practical uses—for instance, if we take an CM 
elliptic curve corresponding to a specific endomorphism ring, we can easily compute its order. 
Hence we can generate an elliptic curve with near-prime order, useful in cryptography. This 
is much more efficient than generating random elliptic curves and using Schoof’s algorithm 
to find their orders. 

There are several big theorems about complex multiplication. In Section [2| we specialize 
our knowledge about the relationship between elliptic curves over C and complex tori to CM 
elliptic curves and build a toolbox of basic facts. However, since we are interested in number 
theory, we want to take curves defined over C and define them over Q instead—which we do 
in Section [8] Once we have these basics, we can then prove the big theorems. 

We suppose E’ has CM by a quadratic order @ C K (ie. End(£) = @), where K is a 
quadratic extension of Q. Then the following hold. 


1. The j-invariant j(£) generates the ring class field of @ over K. In particular, if 
OC = Ox, then j(F) generates the Hilbert class field of K, the maximal unramified 


abelian extension (Theorem |41.4.4): 


2. If E is defined over Hx, and we adjoin certain functions of torsion points of FE, then 
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1 


we get the maximal abelian extension of K (Theorem |41.5.4): 
K(j(£), h( Etors)) = ites 


Compare this with the Kronecker-Weber Theorem, which says the maximal abelian 
extension of Q is generated by roots of unity (torsion points of Q*). 


. j(E) is moreover an algebraic integer (We omit this; see Silverman AT, [Sil94] IT.6].) 


. The action of the idele class group sending K/a to K/x~‘a corresponds to the Galois 


action on the corresponding elliptic curves, where the Galois action is given by the 
Frobenius element of ¢. This is the Main Theorem of Complex Multiplication 
and plays an important part in taking moduli spaces initially defined only over C and 
defining them over algebraic number fields. 


. The L-series of a CM elliptic curve is particularly easy to understand, because it is a 


product of 2 Hecke L-series (Theorem |41.7.5). 


Two “big ideas” we’ll consistently see are the following. 


iP 


We expect abelian extensions because for CM elliptic curves (with endomorphism ring 
Ox, say), the image of the map G(L/Hx) @ Aut(E|m]}) commutes with Ox, not just 
Z and hence must be abelian, with appropriate L. 


. We can use torsion points E|m] to “keep book” on the action of Frobenius, in the same 


way that we used the roots of unity f,, to keep book on the action of Frobenius on 


G(Q(Hm)/Q). 


Elliptic curves over C 


The following theorem helps us understand elliptic curves over C. 


Theorem 41.1.1: tnmaatticecceoc Let go(A) = 60G4(A) and g3(A) = 140G¢(A), where G’, is 
the Eisenstein series. Let A be a lattice in C and ¢ be the associated Weierstrass g-function. 
There is a complex analytic isomorphism between the complex torus C/A and the elliptic 
curve over C, 


y? = 42° — go(A)ax — g3(A) 


given by 


®(z) = (w(z), #'(2)). 


The map ® gives an equivalence of categories between the following. 


I 


Objects: Complex tori C/A, where A is a lattice in C. 
Maps: Multiplication-by-a C/A; + C/A, where aA, C€ Ag. 
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2. Objects: Elliptic curves over C. 
Maps: Isogenies. 


Proof. Silverman [Sil86) VI.5.1.1, 5.3] 


The endomorphism ring of a lattice A C C is either Z or an imaginary quadratic order, 
so the same is true of an elliptic curve EF’ over C. If the endomorphism ring is a quadratic 
order @, we say E’ has complex multiplication by @. 


2 Complex multiplication over C 


sec:cm-C 


2.1 Embedding the endomorphism ring 


We know the endomorphism ring End(E£) of a CM elliptic curve corresponds to a quadratic 
order @ but since any quadratic order has conjugation as an isomorphism, we need to specify 
a way to embed End(£) into C. 


Example 41.2.1: ex-wnicn-i Consider the curve E : y? = 23+ 2. We note that the endomor- 
phisms 


bi(x,y) _ (=z, ty) 
a(x, y) = (=2, —ty) 
both square to —1. Which one should we call [i], multiplication by 7? 


Fortunately, we have a way of embedding End(A) into C, where A is the lattice corre- 
sponding to E, because A itself is in C. This to give a canonical way of embedding End(F) 
into C. 


Proposition 41.2.2:  pr:normatize-cmee Let E/C be a CM elliptic curve with complex multiplica- 
tion by @. There is a unique isomorphism [-] : @ = End(£) satisfying either of the following 
equivalent conditions. 


1. [a] is the unique morphism making the following diagram commute, where the top 
map is multiplication by a. 
C/A C/A 


° . » 


Ex —-z Ex 
2. For any invariant differential w € Ox, [a]*w = aw. 


Moreover, we have the following. 
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3. Define [-]; and |-]2 for elliptic curves FE, and Fy. For any morphism ¢: E,; > Fa, 
do |ali = [a)2 04. 
In other words, multiplication by @ commutes with all morphisms. 


4. For any o € Aut(C), 


i.e. it commutes with Galois action. 


The pair (F,|-]) is called a normalized elliptic curve. After we prove this proposition, 
we will assume all CM elliptic curves are normalized. 


Proof. The uniqueness and existence of |a] satisfying item 1 follows directly from the equiv- 
alence of categories (Theorem [41.1.1). 

Define [a] as in item 1. For any invariant differential w on E,, since ® is an analytic 
isomorphism, we can consider its pullback to C/A; it will be cdz for some c (The space 
of invariant differentials on C/A is 1-dimensional.) Clearly, m*(cdz) = cd(az) = acdz. 
Transferring this to the bottom row of the commutative diagram gives [a]*w = aw. For 
uniqueness, note the map 


Hom(£}, E42) erg Hom(Q»p,, 0) p, )eaieo-ditt-inj (41.1) 
o-¢ 


is injective when all isogenies EF, — E», are separable (in particular, in characteristic 0), i.e. 
the action of an isogeny of elliptic curves on an invariant differential completely determines 
the morphism. Taking £, = E> and considering the preimage of multiplication-by-a gives 
uniqueness in item 2. 

A simple diagram chase shows that (¢o|a]1)* and ([a]20¢)* act the same way on w € Qrg,. 
Then (41.1) gives item 3. 

The proof of item 4 is similar. 


Example 41.2.3: The definition using differentials is useful for calculations. Revisiting the 
above Example |41.2.1| we see that we should let 


[i] (x, y) = =e ty). 


Indeed, defining [¢] in this way, we check that 
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2.2 The class group parameterizes elliptic curves 
Let K be an imaginary quadratic field and @ an order inside K. 
Definition 41.2.4: Let L be a field. Define 


Ell; (@) = {elliptic curves E/L with End(£) = @} 
_ {elliptic curves E/L with End(E) = @} 
7 isomorphism over L 


Ell, (@) : 
ie. Ell;(@) is the set of elliptic curves over L whose endomorphism ring is @. If we omit L, 
we assume L = C. 


If £ € Ell(@), then its corresponding lattice A must be homothetic to a fractional ideal 
of @: indeed, we can scale the lattice so that 1 € A; then @ C Aso A C K; since it is a 
lattice it must be a fractional G-ideal. Now note an @-ideal a has endomorphism ring @ 
iff a is a proper ideal (see Definition [u7]i7.4.5). Hence we get a correspondence between 
isomorphism classes of elliptic curves [E] € Ell(@) and proper @-ideals up to homothety. 
However, two fractional ideals a and 6 are homothetic iff Aa = 6 for some 4, i.e. iff they 
are equivalent in the class group. Thus the class group of @ parameterizes all isomorphism 
classes of elliptic curves with endomorphism ring @. This is summarized in the following. 


__ {elliptic curves F/C with End(Z) = @} _ {proper fractional @-ideal} 
7 isomorphism over C 7 principal @-ideals 


Ell(@) = Cl(6). 


We state this as a theorem. 


Theorem 41.2.5: thm:cu=c1 We have a bijection 
EG) = Cl(@) 


where [E] € Ell(@) is sent to a [a], where a is a fractional ideal homothetic to the lattice 
corresponding to E. 


We get much more than this, however. Ell(@) is a priori just a set; however, Cl(@) is a 
group. We can define the action of [(@) on Ell(@) since [(@) acts on lattices. This action 
will descend to an action of Cl(@) on Ell(@), since isomorphic elliptic curves correspond to 
equivalent ideals. 


Theorem 41.2.6: There is a group action of Id(@) on EIl(@) given by 
akan = Ea-tx 


where FE’, denotes the elliptic curve corresponding to the lattice A. 
This descends to a simply transitive group action of Cl(@) on Ell(@). 


‘When R = @x, all ideals are proper, so this distinction is not important. The reader unfamiliar with 
non-maximal orders can take R = Ox throughout. 
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Proof. Just check that if A has endomorphism ring @, then so does the lattice a~'A. (Note 
that bL is defined by {sa:s € b,a € L}.) 

For the second part, note that E, & aE = E,-1, iff A and a~!A are homothetic, i.e. a is 
principal. 


Remark 41.2.7: Another way of saying that Cl(@) acts simply transitively on Ell(@) is 
that Ell(@) is a torsor or principal homogeneous space for Cl(@). 


This action will be fundamental to our understanding of CM elliptic curves. Later on we 
will relate this to the Galois action. The interplay between these two actions is the source 
for much of the richness of CM theory. 


2.3. Ideals define maps 


For any n € Z and any elliptic curve E, n defines the multiplication by n map [n] : E > 
E. When E has CM, we saw in Theorem that a € @ defines (canonically) the 
multiplication by a map [a] : E + E. We now extend this to ideals: if a is a proper O-ideal, 
a determines a “multiplication by a” map. The only difference is that [a] is now a map 
E> ak. 


Definition 41.2.8: Let E € Ell(@) correspond to the lattice A. Let a be a proper integral 
ideal of G@. We have aR C R, so a determines a map C/A > C/a!A, sending z ++ z. Define 
the multiplication by a-map as the corresponding map on elliptic curves 


[a]: B > Ey, = GE. 
Proposition 41.2.9: ):8j,) Do we need R= Ox? Let F € Ell(@x). We have the following. 
1. The kernel of [a] (the “a-torsion points”) is 


Ela) :={P € E: [a]P =0 for all a € a} = Ox/a. 


2. The degree of [a] is 
deg([a]) = |E[a]] = Ma), 


and in particular, deg({a]) = |E[a|| = Nmx (a). 


Proof. Silverman AT |Sil94| pg. 102-3]. 


3 Defining CM elliptic curves over Q 


sec:em-2 We show that we do not lose anything if we just consider elliptic curves over Q instead 
of over C. To do this, we look at the j-invariants. 
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Proposition 41.3.1: p:j-ai, Suppose F is an elliptic curve with CM by a quadratic order @. 
Then j(£) € Q, ie. j(E) is algebraic. 


Proof. Let a be any automorphism of C over Q. We look at how a acts on j(£). 

Note that FE’ is defined by taking any equation for FE’ and operating on all the coefficients 
of FE by o, so o(j(F)) = j(E”). 

First note that End(#) = End(E’) by the map ¢ + @¢”’. Hence End(o(E£)) = @ as 
well. But Cl(@) is finite, and as |Cl(@)| = |Ell(@)| (Theorem [41.2.5) we see that the E” 
lie in finitely many isomorphism classes. Because isomorphic elliptic curves have the same 
j-invariant, there are a finite number of possibilities for 7(F7). 

As {a(j(E)): 0 € Aut(C)} is finite, j(£) must be algebraic. 


This allows us to prove the following. 
Theorem 41.3.2: tnmci-cq We have 
Ellc(@) = Ellg(@). 
Proof. We use the following properties of the j-invariant. ({Sil86) IIT.1.4}) 
1. For every j € K, there exists an elliptic curve E/K with j(F) = j. 


2. Let K be an algebraically closed field and FE), EF», be elliptic curves defined over K. 
Then FE; © FE, over K iff j(£1) = j( £2). (The backwards direction does not necessarily 
hold if K is not algebraically closed.) 


We show that the map 
eqeliqc€ lg (@) —> Ellc(@) (41.2) 


is an isomorphism (of sets, in fact, of Cl(@)-modules). The map is well-defined, because any 
automorphism over Q is an automorphism over C. 

By Lemma [41.3.1] if [E] € Ellce(@) then j(F) € Q. By item 1, there exists an elliptic 
curve E’ defined over Q with j(E’) = j(£). Then E’ is isomorphic to E over C. Thus 
the map above is surjective. It is injective because if E, E’ are defined over Q and 
isomorphic over C, then item 2 says j(£) = j(£"); and the other direction of item 2 says 
that E & E’ over Q. 


It is also important to know what fields we can define elliptic curves and isogenies over. 


Proposition 41.3.3: Suppose F is an elliptic curve with CM by @ C Kk, where K is an 
imaginary quadratic field. 


1. If FE is defined over L then endomorphisms of F can be defined over LK. 


2. If E\, F, are defined over L then there exists a finite extension M/L, so that every 
isogeny E, — FE is defined over M. 
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Proof. For item 1, note that all endomorphisms are in the form [a] and use Proposi- 
tion ). 

For item 2, first we claim that any isogeny ¢ is defined over a finite extension of L. For 
any o € Aut(C) fixing L, 6? isa map FE, — E>» having the same degree as ¢. Any isogeny is 
determined by its kernel, up to automorphism of £; and Ey. As EF; has a finite number of 
subgroups of given index and deg(¢) = ker(@), there are finitely many isogenies of a given 
degree. Hence {67 : ¢ € G(C/L)} is finite, showing ¢ is defined over a finite extension of L. 

Now Hom(F), £2) is a finitely generated group, so we can take the field of definition for 
a finite set of generators. 


4 Hilbert class field 


4.1 Motivation: Class field theory for Q(¢,,) and Kronecker- Weber 


sec:kw-cm 


4.1.1 The case of Q 


First we give some motivation for the next two sections by making an analogy with class 
field theory for Q(¢,). We can think of fun, the nth roots of unity, as the analogue of E[n]: 
[in are the n-torsion points of the group variety Q* under multiplication, and E [n] are the 
n-torsion points of an elliptic curve. To emphasize this analogy, we write K™*|n] to denote 
the nth roots of unity in K. 

Recall how we established class field theory for Q(¢,): given a prime p, we want to find 
(p, Q(¢n)/Q). To do this we looked at the action of (p, Q(¢,)/Q) on Q*[n] = fn, by taking 
everything modulo p. We know by definition of (p, Q(¢,)/Q) how it must act on the residue 


field extension [/F, and hence on F*[n]. Suppose p {n. Because the maps 


Q*[n] > FE [n 
End(Q*[n]) + End (F% [n]eaena-in (41.3) 


are injective (the first is because p { n and the second is a direct consequence of the first), 
once we know how (p, Q(¢n)/Q) acts on F¥[n], we know it acts on Q*|n], so we know exactly 
what automorphism it is: 


(p, QM(Gn)/Q)(Gn) = Ce 


In particular, since ¢, is a n-torsion point (i.e. ¢” = 1) this only depends on p (mod n). 
Hence we get the Artin map Wo¢,)/@ factoring through the modulus oon:? 


Voten/@ : Lo/Ig(1, noo) = G(Q(¢,)/Q). 


?The oo is a technical detail coming from the fact that Q is totally real. 
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Finally, since every modulus divides con for some n, we get the Kronecker-Weber Theorem 


Q” = Q(Go) = Q(Q*[o0]). 
In summary, we found the ray class groups and thus the maximal abelian extension by 
looking at how (p, Q(¢n)/Q) acted on Q*[nJ: 


oe Or) satica Fp (41.4) 
© © 


Tp/Po(1, noo} 8G (Q(Gn)/Q) =» G(Fp(Gn) /Fp)- 


4.1.2 The case of kK 


One big difference when we’re working over an imaginary quadratic field K is that while 
we had Clg = 1, we have Clx is nontrivial in general. This corresponds to the fact that 
there is only 1 nonisomorphic “version” of G,,(Q) = Q*, but multiple elliptic curves with 
endomorphism ring by the same order @. Hence G(K*>/K) no longer operates on the same 
elliptic curve. Instead we have to analyze it in two steps. 
1. Consider the action of G(Hx/K) on Ellg(@), i-e. equivalence classes of elliptic curves 
with CM by @. 


2. Consider the action of G(K*?/Hx) on the torsion points Ejors of a single elliptic curve. 


In both cases, we will understand the action by looking at how the Frobenius elements of 
the Galois groups act. 


4.1.3. The case of K: Part 1 


We have two natural actions on the set of elliptic curve Ellg(@x), namely the action of 
G(K/K) and Cl(@x). Our first task is to relate these, i.e. find a dotted map that preserves 
the action on Ellg(@x): 


eq:G-cl-compat Ellg( Ox) (41.5) 


We'll see that this map factors through G(L/K) where L = K(j(£)). We have a map 
WL/K : Ti. /Px(1,f) + G(L/K); we show that f = 1 and the composition of the two maps is 
an isomorphism, and that in fact we have 


eq:G-cl-compat2 Ellg(@x) (41.6) 
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We establish (41.6) by looking at the reduction of the elliptic curves modulo some 9. 
Since G(Hx/K) = Cl(@x) this will show that L = Hx, the Hilbert class field of Kk. 


4.1.4 The case of kK: Part 2 


We can now do the same thing we did with Q, use the torsion points of elliptic curves to 
find the ray class fields and the maximal abelian extensions. We can’t work directly over 
K because Cl is nonzero, but if we imitate the argument (with some modifications) over 
Q for Hx we will get the ray class fields of K. We let L, = K(j(E), h(E[n])) where h is a 
Weber function (to be defined). 

Let I,, | be the residue fields of L, and Hx modulo some prime. We show L,, is the ray 
class field for (n) by constructing the diagram 


ages El] vey Bln] (41.7) 
© oO 


Nu /x(18,.)/ Px (1, ny G(Ln/ Hx) “=> G(ln/2). 


We now carry out these two parts. 


4.2 The Galois group and class group act compatibly 
We establish the map in (41.5). 


Theorem 41.4.1: tim:map-c-: There exists a map F': G(K/K) > Cl(@x) such that for any 
elliptic curve E, 
[E7] = F(o)E. 


This map factors through G(K?»/K). 


As a reminder, the action of Cl(@x) on Ellg(@x) is such that if EH = Ey, then F(o)E = 
ER(o)-11- Theorem [41.4.1] expresses a deep relationship because the left-hand side expresses 
an algebraic action, while the right-hand side expresses an analytic action, as it is defined 
on lattices and the map between F and C/A is inherently analytic. 

Proving this theorem essentially boils down to showing the Galois action commutes with 
the action on Cl(@x). 


Proposition 41.4.2: For all E, 


Proof. Suppose E corresponds to A, i.e. FE = C/AC. Then we have the exact sequence 


0O- ASCO ELS 0. 
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Then a£ corresponds to a~tA. Take a resolution for a: 


R™ AR" 54350. 


Take a “Hom product” and use the Snake Lemma. See [Sil94| IT.2.5]. 


Proof of Theorem|41.4.1} See |Sil94} 11.2.4]. 


4.3 Hilbert class field 


Before we proceed with finding the Hilbert class field, we need to show injectivity of the 
reduction map like in (41.3). 


Theorem 41.4.3: tim:cchomrea Suppose £, and EF» are elliptic curves defined over L with 
good reduction at $8. Then the reduction map 


Hom(E), Ey) > Hom(Fy, E2) 


is injective and preserves degrees. 


Proof. See Silverman AT |Sil94| pg. 124] (Also see Silverman’s errata). 


Eventually rewrite this to work for all orders. The main theorem of this section is the 
following. 


Theorem 41.4.4 (j(£) generates the Hilbert class field): tnm:j-gencrates-nitberr Let E’ be an elliptic 
curve with CM by @x. Then 


1. K(j(E)) = Hx, the Hilbert class field of K. 
2. G(K/K) acts transitively on the isomorphism classes of curves in Ell(@x). 


3. For any ideal a € Ix, 
[Evie ) = [al [E]. 


In particular, the action of Frobenius on the j-invariant is given by operating by [p] on 
the elliptic curve: 
[EO MK) — [p][E]. 


Proof. Step 1: First we show the following: There exists a finite set of primes S of Z such 
that for any p ¢ S that splits completely in K, p = pp, we have 


F((p, L/K)) = [p] € Cl(@x). 
This will show the dotted map in (41.6) is the identity for a large number of primes p. 
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We have the map |p] : E — pE. We show that this is “like” the pth power Frobenius 
map. To do this, we show that it is inseparable of degree p (this is why we needed p to be 
split)?, and then look at the j-invariants of the reduced maps modulo p. 

As Ellg(@x) = Ellc(@x) is finite, we can find a finite extension L/K and representatives 
E,,..., E;, of classes in Ellc(@x), that are defined over L. Let S be a set of primes containing 
the primes that satisfy one of the following conditions. 


1. p ramifies in L. (Primes that ramify always cause trouble.) 
2. E or some £; has bad reduction at some prime of L lying over p. 


3. u,p(Nmz/o(j(£i)—j(Ex))) 4 0 for some i # k. (‘This allows us to know what equivalence 
class an elliptic curve lies in, just by looking at its reduction modulo p.) 


Let A be the lattice such that E(C) = C/A, and let a be an integral ideal relatively prime 
to p such that ap = (qa) is principal (This exists by Corollary |15]15.2.5). By the equivalence 
of categories |41.1.1} the following maps on complex tori correspond to isogenies of elliptic 


curves: 


C/A 4 C/pA 4 C/pta ta Soc/a 


~ 


=| “9 <9 =|9 


~ 


w 


Let the composition of the top maps be f and the composition of the bottom maps be g. 
Let w be an invariant differential on EF. Then w’ = ®*w is an invariant differential on 
C/A. It is in the form cdz. The composition of the top maps is just multiplication by a, so 
f*w’ = aw’. By commutativity, we get g*w = aw as well. 
Let pg S and B|p|pin L, K, Q, respectively. Since EF has good reduction at $8, we 
can reduce the elliptic curves and maps modulo §¥ to get 


gw = aw =0 


since 8 | a. By a criterion for separability (g is separable iff g* does not act as 0 on Op), g 
is inseparable. Now 


deg(¢1) = Mp = p, 
deg(¢2) = Na L p, 
deg(3) = 1. 


An inseparable map must have degree divisible by p, and the composition of separable maps 
is separable, so ¢; must be inseparable. 


3If p is not split, one can still show the map is inseparable of degree p?, with some more work. 
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Any inseparable map factors through the Frobenius map: 


pE-factors-frob E —_ E©) (41.8) 


1 ——— 


ph. 


We have pdeg(e) = deg(@,) deg(e) = deg(d1) = p so deg(e) = 1. This shows ¢ is an 
isomorphism. 
Thus we have 
pE ~ Bl). 


Now by definition of the Frobenius element (it is the pth power map modulo ‘B), we have 
j(E) = j(E)? = 7(E)"4/) modulo $. Putting everything together, 
j(pE) = j(E) = j(HP4/) (mod ). 


But we chose p so that nonisomorphic curves have j-invariants that are not congruent mod- 
ulo p (item 3). Therefore, pL & E®+/*), This shows that the action of p is the same as the 
action of (p, L/K), ie. F((p, L/K)) = [pl]. 


Step 2: We show that F : G(k/K) > Cl(@x) has kernel equal to G(K/K(j(E))), and so 
factors through G(K(j(E))/K) @ Cl(@x). Indeed, 


ker(F) = {o: F(o)E = E} 
={o:E° =E} definition of 7 
= {ot ge) =4(8)} j parameterizes isomorphism classes 
= G(K/K(G(£))). 

We let L = K(j(£)). 

Step 3: Let f be the conductor of L/K. We extend Step 1 to all ideals a: for all a we have 

F((a, L/K)) = [a] € Cl(@x); 
in other words f = 1 and the following composition is the identity map. 


WL/K 
sce le | Pe—— GLE 3 Cw: (41.9) 


~ 


Given a € J/., there are infinitely many p € ee in the same class as a with degree 1 by 
Corollary |29}29.3.6} Choose such a prime p, that does not divide a prime in S. Note a,p 
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differ by an ideal in Px(1, f) so they have the same image by the Artin symbol. Step 1 shows 
that 
Step 1 
F(a, L/K)) = F((p, L/K)) °° [p] = [a]. 


In particular, for any principal ideal (a) € I}, we have F(((a),L/K)) = 1. However, by 
definition the conductor is the smallest p such that a = 1 (mod f) implies ((a), L/K) = 1, 
so we must have f = (1).4 Thus the map F : I/-/Px(1,f) ~ G(L/K) we had originally is 
actually just F : In /Px + G(L/K), and we get (41.9. 


Step 4: Since the conductor is divisible by exactly the ramifying primes, L/K is unramified, 
and L C Hx. On the other hand, the map Fo Wy/K : Ix /Pr — Cl(@x) is an isomorphism 
because F’owy/x is just the identity map. This gives [L : K] = | Cl(@x)| = [Hx : K]. Hence 
L = Hx. This shows item 1. 


Step 5: Item 3 now follows immediately, since we already showed BY£/«() = [a] and we now 
know L = Hx. Item 2 follows since the fact that the composition in is an isomorphism 
means the map F’: G(L/K) — Cl(@x) is surjective. Since F transfers the action of G(L/K) 
on Ellg(@x) to Cl(@x), and Cl(@x) acts simply transitively on Ellg(@x), we get that the 
same is true for G(L/K). 


5 Maximal abelian extension 


We next carry out part 2 of our outline in Section We construct the ray class fields for 
kK, then take their compositum to get the maximal abelian extension. 


Definition 41.5.1: Suppose EF has CM by an order in K, and FE is defined over Hx. A 
Weber function is an isomorphism h : E/ Aut(1) — P’ defined over Hx. (So if f : E > E’ 
is an automorphism, then h(P) = h(f(P)).) 


We can always fix a concrete Weber function. 
Example 41.5.2: The simplest Weber function is the following. If FE has the form 


y=o'+Ar+B, A,BEe Hx, 


then take 
(>, ABZ£0 
h(P) = 12, B=0 
(2°, Cat 


“Technically, we only have ((a), L/K) = 1 for (a) 1 f, and a priori ((a), L/K) is not defined for (a) L f. 
(We don’t know f = 1 yet.) The proper way to conclude f = (1) is transfer the problem over to ideles: We 
know wz/«(P).) = 1, so or/«(K*UR) = 1. By T/K(1,f)Ux(1,f) & Ix/K*Ux(1,f) we conclude that 
éijk(K*Ux) =1. Hence f = 1. 
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In the 3 cases, respectively, Aut(£) is 1, Z/2 or Z/4, and Z/3 or Z/6. 
We can define a Weber function that is “model independent,” i.e. doesn’t change under 
if we change to an isomorphic elliptic curve, by 


This is because the expressions have “weight 0.” 


The importance of the Weber function is given below. It would not be true if h(P) were 
just defined as h(x, y) = x. 


Lemma 41.5.3: tem:-ab-ext Let EF be an elliptic curve with CM by @. 
1. The extension K(j(F), Etors)/K(j(E)) is abelian. 
2. The extension K(j(E), h(Etors))/K is abelian. 


The first statement is important because it tells us G(A’/K(j(E))) acts in an abelian 
way on Fior5. Thus the “Galois representation” of the Galois group on Fors is abelian. Thus, 
as we will see, it will decompose into two Grossencharacters. 


Proof. We have an injective map G(K(j(E), E|m])/K(j(E))) © Aut(E[m]).° Now, the 
image of G in Aut(E|m]) commutes with Ox, so is contained in 


Auton /mox (E[m]) = Autox/mox(@x/MOx) = (Ox /mOx)” 


which is abelian. 

For the second, suppose 0,7 € G(K(j(E), h( Eters))/). We show that or = To. Since 
K(j(E))/K is abelian, oro~'!7~! fixes j(E). Now oro~'r~! gives an automorphism of 
E’ = To(E) because 

(ota '1~")r0(E) = or (E) = t0(E), 
as the Galois action factors through G(k@?/K) and hence is abelian (Theorem |41.4. 1) (alter- 
natively, because oto~7~! fixes j7(£)). As E is defined over Hx, we actually have equality. 

Since h is invariant under automorphism, for any P € Ejors, 


BP) = h(aro—'t'P) a ota ‘Tt 'h(P). 


(We know h is defined over Hx and oro~!r~! fixes Hx = K(j(E)).) Hence ora~!r~" fixes 
h( Ftors) as well, and oro—4771 = 1. 


°Since E[m] = Z/m x Z/m, if we choose a basis for E[m], we have Aut(E[m]) = GL2(Z/m), so we have 
a Galois representation. 
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Theorem 41.5.4: thm:maxabe-ext-x Suppose K is a quadratic imaginary field and F has CM by 
Ox. 


1. For an integral ideal a of Ox, Ly := Hx(h(E[a|)) = K(j(£), h(Ela])) is the ray class 
field of K modulo a. 


2. The maximal abelian extension of K is 


K(j(E), h(Etors))- 


Proof. Step 1: We need the following lemma. 


Lemma 41.5.5: tem:comm-in-image Suppose F is an elliptic curve defined over L with CM by @x, 
and has good reduction at $8. Let E be the reduction modulo 8. Let @ : End(#) — End(£) 
be the reduction map on endomorphisms. Then for any y € End(F), 


y €im(™) <> 7 commutes with every element in im(@). 


Proof. Since E has good reduction, the map End(EZ) @ End(£) in injective. Consider 2 
cases. 


1. End(£) is a quadratic order. Then End(E) = End(£) (as End(£) is a maximal order) 
so this case is clear. 


2. End(£) is an order in a quaternion algebra. Then End(£) @ Q is its own centralizer 
in the quaternion algebra End(E) ® Q, by the Double Centralizer Theorem |26]26.4.11 


Step 2: We show that in general, we can lift the Frobenius map. 


Proposition 41.5.6: prcciitt-ton Suppose EF has CM by @x and is defined over Hx. Let 
8] p|pin dx, K, Q, respectively, with p having degree 1 and p ¢ S, S being defined as in 
the proof of Theorem [41.4.4] Then the pth power Frobenius map can be lifted to a map on 
E, i.e. there is 4 making the following commute: 


Moreover, if F corresponds to the complex torus C/A, then up to isomorphism, corresponds 
to the map C/A > C/pA. (Recall that E®4%*/®) & pF by Theorem |41.4.4}) 
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Proof. We need to show ¢, is the reduction of some map; we do this by first reducing the 
problem to showing a certain endomorphism is in the image of @ and then showing the 
conditions of the previous lemma hold. 

Again we use (41.8): b1 a ec pE is “like” the Frobenius map. We know b1 is the 
reduction of a map, namely the map ¢; : E > pE. Now note pE & E®£/*) = EB), the first 
from Thm |41.4.4]and the second from definition of the Frobenius element. 

Let o = (p, L/k). It remains to show that «: E? + pE = E7 is the reduction of a map 
e', because then <’~' o ¢, will be the desired map. Let [a] € Aut(E7) be the reduction of 


a map |a|. To show ¢ commutes with [a], we consider ¢; = € 0 @,, and consider how |a 
Pp 


“commutes” with dy and @p. 
1. $1: By normalization (Proposition /41.2.2(3)), we know 
1 © [alg = [alge © oy. 


2. bp: Note that for any morphism of varieties f : V — W over a field of characteristic 
p, the following commutes, where ¢y, dw are the pth power Frobenius maps on V and 


W: 
V—-_4W § ¢dwof=fody. 
ov ow 
bia 


V(r) -~,w) 
Applying this to [a]z, 


ee eee 


dbp © [ale = [alk © bp = [a]ze © dy, 


where in the last step we used Theorem |41.2.2(4), noting o(a) = a since a € K and 
O€ G(HxK/K). 


Hence 


eo 


lal pr 0€ 0b) = € 0 by [alp 


g1 


2 £0 [alge 0 dp. 


Cancelling ¢, gives [a]ze 0€ = € 0 [a]ge, so Lemma |41.5.5] shows é is the reduction of some 
e’, as needed. 

To finish, note that ¢, does indeed correspond to C/A + C/p~tA. Hence \ corresponds 
to C/A + C/p-!A, up to some automorphism. 


Step 3: When (p, Hx /K) = 1, A is just an endomorphism of F, hence equals [a] for some a. 
In fact, the following proposition shows it is |7] for some 7 generating p, so that multiplication 
by a corresponds to the pth power Frobenius in the reduction. 
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Proposition 41.5.7: prpiistron Suppose EF has CM by @x and is defined over Hx. For all 
but finitely many degree 1 prime ideals p with (p, Hx/K) = 1 (equivalently, such that p is 
principal), there exists a unique 7 such that p = (7) and the following commutes. 


[7] 
— 


ere 
Proof. Since (p, Hx /K) = 1, Proposition |41.5.6] gives a diagram 


for some A. We know 4 is in the form [7], and show 7 satisfies the desired conditions. We 
have by Proposition |41.2.9| that 
Nmxjq(7) = deg([7]) = deg(¢) = p = Mp 


so either (7) = p or (7) = p. As always, when we’re deciding between conjugates, normaliza- 
tion comes to the rescue. Take w € Q¢ whose reduction modulo 8 is nonzero. Normalization 
says that [z]*w = mw so 


ai = [nr] G = 6° =0, 
the last step since the Frobenius map is inseparable. We get 5B | 7, forcing (7) = p. 
For uniqueness, note the map 


6x —, End(E) 25 End(E) 


is injective for E having good reduction at $8 (Theorem |41.4.3). 


Step 4: Consider (41.7). We need to show that Px(1,a) is exactly the kernel of the Artin 
map wr,/K- Note that Px(1,a) and ker(w,,/«) are both subgroups of Pt = ker(Wy,/K) = 
ker(wWr,/K(¢)|#,,)- It suffices to show that for all but finitely many primes p of degree 1 such 
that (p,Hx/K) = 1, we have p € Px(1,a) iff p € ker(wz,/x). 

Let p satisfy the conditions of Proposition Since the reduction of wWr/x«(p) is 
the Frobenius map, we get that Wp/«(p) = [a], for some 7 such that (7) = p.° Since 
(p, Hx/K) = 1, we have the commutative diagram 


oe (41.10) 


Ll 


E45. 


Note the analogy with the cyclotomic case. w,/x(p) acts on torsion points as [7], just as in the cyclotomic 
case it acted as the pth power map, that corresponds to [p] if we consider the natural map Z > End(Q(¢,,)). 
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We have the following string of equivalences, for all but finitely many degree 1 primes p 
with (p,Hx/K) =1, 


1. p € Px(1,a). 

2. p = (a) where 7 = ua where wu is a unit and a=1 (mod a). 
3. For all a-torsion points P € Ela], h([z]P) = A(P). 

3/. For all a-torsion points P € Efal, h((]P) =h(P). 

4. (p, £,/K) fixes h(E[a]). 

5. p € ker(z,/x): 


(1) ==> (2) is clear. 
For (2) = > (3), note that for all a torsion points P € E|al, 


h([7]P) = h([ulla]P) 
= h([a]P) his Aut(£)-invariant 
=P) a=1 (mod a) and Pe E{al. 


Note it is important that h be Aut(£)-invariant. _ 

For (3’) => (2), let P € Ela] be a torsion point. By VIIL.3.1b], Ela] > Ea] is 
injective for p{a and E with good reduction at p. Since / is an isomorphism (in particular, 
an injection) E/Aut(#) — P', we get that [z]P = [u]P for some [u] € Aut(#). But 
E\a] = @x/a, so we can choose u such that 7 = u (mod a). Then there exists a such that 
m7 = ua, with a= 1 (mod a). 

For (3) ==> (4), we calculate the action of (p, L/A) on a torsion point P € E{al, in the 
reduced curve: 


POLE = by(B) = IP, 


the second equality from Proposition |41.5.7} This allows us to understand the action on the 
nonreduced curve, since Ea] ~ E|a] is injective for p {a and p of good reduction. We get 


POLO — [a] P. 
Thus (3) implies 
h(P)\eL9 — p( Pet!) (p, L/K) fixes Hx and E defined over Hx 
= h([x]|7) 
= A(T) by (3). 


Now we prove (4) => (3/). Let o € G(K/K) be an automorphism such that o| ja» = 
(p, K*>/K). Then for any P € Efal, 


i((n)P) = i(o(P)) = h(P*) = A(P)” = h(P), 
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the last two equalities since o|4 = 1, h is defined over H, and o|,, fixes h(E|a]) by assump- 
tion. Thus (3’) holds. 

Now (4) <=> (5) comes from the fact that (p, La, K) already fixes K(j(F)), so to fix Lg 
it only needs to fix h(E|a}). 


Step 7: The maximal abelian extension is the union of the all ray class fields. Note every ¢ 
divides n for some n so we can just restrict to ray class fields corresponding to (n) for some 
neN: 


K*® = K(j(B), h(E[n))) = K(j(B), h(Erors)): 


6 The Main Theorem of Complex Multiplication 


Given o € Aut(C/K), consider the map 0 : E(C) > E?(C). We would like to know how this 
map acts on torsion points. This is since to get Galois representations of elliptic curves, we 
look at how o acts on torsion points—often specializing to torsion points that are a power 
of a prime. 

Because we are considering CM elliptic curves, we can identify the torsion points with 
K/a, for some ideal a. Namely, given an analytic isomorphism f : C/a = E(C), we can 
restrict it to K’/a to get 

Fleet Kia Lea = £(C) 

The main theorem of complex multiplication tells us we can transfer the map 0 : E(C) > 
E?(C) via an analytic isomorphism to a multiplication-by-an-idele map [x~'] : K/a > 
K/x~'a, where x and o are related in terms of the Artin map (to be made precise). 


Definition 41.6.1: Let x = Ipeve pl) Toeve v™’) € Ix be an idele. Let a be an ideal, 
and define xa by 


xa = p(x)a= ( Il pe) a. 


peV 


Define the map 
eq:mult-idele-map|X| H K/a — K/xa (41.11) 


as follows. Note K/a = J], Ky/aKky by the Chinese Remainder Theorem, where «x is just 
identified with its images in the Ay/aly: (xp)pevo. Then (41.11) sends 


eq:mult-by-idele( Gp ) > (LpQp) where x = (Lp). (41.12) 


Theorem 41.6.2 (Main Theorem of Complex Multiplication): tim:nt-om Suppose EF is an 
elliptic curve with CM by @x. Let o € Aut(C/K) and x € Ix be such that 


O| Kab = OK(X). 
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Fix an analytic isomorphism f : C/a — E(C). Then there exists a unique analytic isomor- 
phism f’: K/x7'a > E?(C) such that the following commutes: 


K/a—> K/x"a 


| 


E(C) —%4 E°(C). 


Remark 41.6.3: The map (41.12) can be a bit weird to think about: For instance, consider 
the simpler case K = Q, a= Z. Take the idele x with 1’s everywhere except 75 = 2. Then 


x] sends } 4 4,4 + 1,2 > 2} and so forth but sends + > 2. So it is surprising that 
2 225 a2 7 7 


5 5 
x! : K/a + K/x7'a can be related analytically to E(C) — E?(C). 


Compare this theorem to Proposition |41.5.7| Rather tan just dealing with the Frobenius 
element of a prime, we deal with the Artin map of an idele. 


Proof. Note uniqueness follows from the fact that topologically, the closure of K/x7'a is 
C/x~ta, and any continuous function is determined by its values on a dense set. 


First we prove this for E defined over Q(j(£)) and a integral. We do this in 2 steps. 
Step 1: Approximate o by a field automorphism \ that is the Frobenius element of a prime 
p. (The Frobenius element is something much more concrete to work with than the abstract 
Artin map of an idele.) We will take better and better approximations, which determine the 
action on E|m|] for larger and larger m, and take an inverse limit. 

So let Li, be the Galois closure of K(j(£), E|[m])/k. By Corollary ??.??, there are 
infinitely many primes with $8 | pin AK and L such that 


(B,L/K)=oeln,, Wp) =1. 


We can furthermore choose p satisfying the following, because each condition excludes only 
finitely many primes. 


1. p is unramified in L/,. 
2. p € S, where S is defined as in the proof of Theorem |41.4.4 
oe prin. 


By Proposition |41.5.6| there exists a map A: & — E’ that reduces to ¢, modulo $B. On 
E|m], both and a act as d». Because $B { m by item 3, the reduction map modulo 9, 
E|m] > Elm], is injective. Hence » and a act the same on E[m|: 


Elm) = 0|B{m) : Elm] > E°[m]. (41.13) 


SailacetA 
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But we know how the map \ acts: Proposition |41.5.6| tells us that the map \: E > E? 
corresponds to the map on complex tori i: C/a + C/p~'a.” Hence we have the commutative 
diagram 
eq:mt-cm-1 C/a ti C/p-ta (41.14) 
|) |r 
E(C) +4 E°(C) 
for some analytic isomorphism f”. 
Step 2: By Theorem /41.5.4| the ray class group modulo m is K, = K(j(E), h(E|m])). Note 


Ky, C Li. Now by assumption, p was chosen so that the images of p and x under the Artin 
map both project to o|x,,: 


PKm/K(X) = O Km = VKin/K(P) = OKin/K (ip(™)) 


where w, ¢ denote the Artin map on ideals and on ideles, respectively, and z is the uniformizer 
of p in Ky. We have 

ker WKim/K = K*UxK(1, m). 
(See Definition [24]24.5.8] for notation.) This follows from the definition of the ray class field 
and from the correspondence between ray class groups in Definition [24]24.4.5] and idele class 
groups in Example We have x € ip(71) ker Ox,,/K, giving 


x=a-i(7)-u, aeK*, uc€UxK(l,m). 
We now compose (41.14) with the homothety a~!, and note (x) = (a)p, to get the desired 
map C/x~'a > E?(C): 


dome Cla—"sC ip 4 0 6 (41.15) 


bo ee 


E(C) + E?(C) 


Here, f(2) =f" (az): 

This isn’t quite what we want yet, though, because the top row is the map a~ 
than the map x~'. We need to show that for m-torsion points, a~! 
Then we would have 


o(f(t)) =A) = flo") = fn(x"t), te m™*a/a. 


The first equality is since a, A were by construction the same on E[m| (41.13), so a0 f and 
Ao f are the same on m~'a/a. The second is by commutativity of (41.15). 


! rather 


acts the same as xt. 


The map o and x~! appearing in the theorem statement are bijections, while \ and i are not. This is 
okay, though, because we only use 4,7 to approximate o on m-torsion, and 4,7 are injective on m-torsion, 
since $B { m. 
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To show the third equality, we note that 


1 (ort = ie 2) for allt € m“'a/a 


(fi, bijective) << sa 't—-x tea for allt € m~'a 
<= a'ty - coat Eq for allt € m~'a, q 
(multiplying by 74 = a[t,(7)]quq) => [tp(7)]quqt — t € ay for allt € m~'a, 
= (ltp(T)] qq — 1) dq C may 
Ug € UK(1,m) ==> ([tp(7)]q — Lag © mag. 


Consider 2 cases. 
1. qAp. In this case, [i,(7)|, = 1, so this is trivial. 
2. q=ph: [ép(7)|, = 7, and 7—1isaunit. By assumption p { m. hence (t—1)a = a= ma. 


Step 3: We now show that the maps f’, are all actually the same for m > 3. Indeed, 
finleim) = fimnle[m) by construction, so fi, fy,» differ by an automorphism that fixes E[m]. 

This automorphism must be [¢] for some element of norm 1 in A, and f/, = [¢]o fi,,,. Since 
Ty dion ave isomorphisms, this says 


E{m] C ker[1 — ¢] 


The only possibilities are ¢ a 4th or 6th root of unity, and if ¢ 4 1, then [1 — ¢] has norm at 
most 4. Sovtor¢ 3, ¢ =, and ff: 


Step 4: Finally, we show the theorem holds for general E/L. Any elliptic curve F has a 
model E” defined over M’ = Q(j(F)), corresponding to a complex torus C/a’ with a’ an 
integral ideal (see the left face below). Let E — E’ be an isomorphism and K’/a > K/a' be 
the corresponding map on torsion. Then the existence of fi, for E’/L gives the existence of 
fi, for E/L, by choosing fi, to make the below diagram commute. 


-1 


K/a * K/x7'a 
fr K/a Fh K/x7ta’ 
E(C) “Tie E?(C) o 


a 
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6.1 The associated Groéssencharacter 


The Main Theorem involved 2 different elliptic curves, and 2 different analytic isomorphisms. 
In the special case that o fixes E’, the curves will be the same, and by nudging the map 
upstairs by a constant depending on x, we can restate the theorem using a consistent choice 
of f. (Compare to how we specialized from Proposition to [41.5.7]) The action of 
@1(x) on the elliptic curve will “essentially” correspond to multiplication by yg/, on K/a. 


Theorem 41.6.4 (Gréssencharacter of an elliptic curve): timgrossenec Let E/L be an elliptic 
curve with complex multiplication by @x, and suppose kK C L. Let x € I, and y = 
Nmz/«(x) € Ix. Then there exists a unique a = ag/,(x) € K™ with the following properties. 


1. oO = (y). 


2. For any fractional ideal a C K and any analytic isomorphism f : C/a > E(C), the 
following commutes. 


K/a——> K/a 
aby L(x) ab 
E(L*>) 2°53 R(t), 


Moreover, defining xz/z : I, + C* by 


XB/L(X) = @py1(x)[Nmz/K (x "Joo, 
Xe/x is a Grossencharacter of K’, and xg, is ramified at B (ie. ye/_(Us) is not identically 
1) iff EF has bad reduction at $B. 


Proof. Part 1: Since f is an isomorphism, uniqueness is clear. To construct a, choose any 
o € Aut(C/ZL) such that o|pa» = d, (x). We use Theorem |41.6.2| with o and y € Ix, noting 
the following points. 


1. E° = E since E is defined over LE and a fixes L. 
2. The image of f is contained in E(L?) as Eto € E(L?”) by Lemma 41.5.3] 
3. By compatibility of the Artin map, ¢,(X)|«a» = @«(Nmz/K X) = $x(y). 
We obtain an analytic map f’ making the following commute. 
K/a—_> K/y—a 
boo 
E(L*®) 223 E(1°). 
Because 


C/y'a = E°(C) = E(C) = C/a, 
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we have that y~‘a is homothetic to a, i.e. there exists 6 so that 6 takes K/y~‘a back to 
K/a. Defining f’(x) = f’(8~'x), we have that it differs from f by some automorphism [¢): 
fol¢] =f”. Let a = 6¢. Then we can extend the above diagram as follows. 


-1 
K/a——> K/y“'a “=> K/a 


Fe ee 


E(L?°) oi (x) ELL) 


As ay~'a =a, we get (a) = (y). 

To see that a is independent of f and the ideal a, let f’ be another analytic isomorphism 
K/a — E(L*). Let the map K/a’ > K/a be multiplication-by-y. Then f(7z) is also an 
analytic isomorphism K/a’ > E(L*»). Hence y~'f~! o f’ is an automorphism [¢] of K/a’, 
ie. f'(x) = f([Clyx). Thus ¢,(x)[f’(2)] = f’(ay—'z) as well. 


Part 2: ag/z, and hence yg; is a homomorphism since it’s clear that ¢,(xx’) o f = fo 
aa'yy’*, and $,(x"")o f = foaty. 

We need to check that xg/,(L*) = 1 and that y#/z factors through a modulus. 

For the first point, note ¢,(L*) = 1, the identity element of G(L?>/L). Let i: K* > Ix, 
L* —+ I, be the diagonal maps, and suppose x = i(x). We have y = Nmz;x(i(x)) = 
i(Nmy/x(x)). Then a is just the element such that a Nmy/x(x)~' induces the identity map, 
ie. a = Nmpyx(z) = [Nmzx Xoo, 80 Xz/i(X) = 1. 

For the second point, fix m > 3 (m = 3 works fine). We’ll show that for any idele x ina 
small enough open subset of finite index, @r(x) acts just like multiplication by ag/,(x) and 
fixes E|m], without the extra Nmz/«(X)oo factor, so that a will actually be 1. 

Let B,, be the kernel of the Artin map I, > G(L(E|m])/Z) (abelian by Lemma 41.5.3), 
so that it induces an isomorphism 


catEmPL(Bfm))/L : lt /Bm — G(L(E[m])/L). (41.16) 


We show that 
Um = Bm mM" if (Ninz i Ux(1, m)) = ker XE/L: 


This is of finite index in I, since B,, is open of finite index in I, and K*Ux(1,m) is open 
of finite index in Ix. 

Fixing an analytic isomorphism f : C/a — E(C), we get that for any t € m~ta/a and 
any x € Up, f(t) € Elm] so 


f(t) = fe by (E116) and x € By 


= f(aNmz;x(x)7't) by the Main Theorem |41.6.2 
= fot) t€m‘a/aand Nmz/x(x)p =1 (mod m@x,) for all p. 


Thus multiplication by a fixes m7'a/a, ie. a = 1 (mod m@x). Note Nmp/x(x)7* € 
Ux(1,m), SO 
(a) = (y) = (Nmz/x(x)) = Ox 
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and a is a unit. Together with a = 1 (mod m@x), we get a = 1.8 


Part 3: The relationship between ramification and bad reduction hinges on the Néron-Ogg- 
Shafarevich Criterion. See [Sil94) pg. 169-170]. 


Note that if yg/z, is unramified at B, then xg/r(ig(Ug)) = 1, so it makes sense to talk 
about Ya/1(%8) (defined as x ~z/_(ip(7)) for any uniformizer 7). 


Proposition 41.6.5: Let E/L be an elliptic curve with CM by @x, with kK C L. Let PB 
be a prime of L of good reduction, let EK be the reduction of F modulo $B. Let ¢@y be the 
Frobenius on F. Then the following commutes. 


E xe/L(®)] E 
L i. ot 
Ee —— > E 
Proof. Let m be a uniformizer of Ly, and let @ = ig(m). Note that a = 1. Hence 


Nmz/K(@)oo = 1, giving 
Xe/L OB) = Xx/1(@) = an/1(@). 


If m is an integer such that P% { m, then Nm; /«(@) fixes m~‘a/a (since it is 1 at all Q with 
Q | m). Then 


f(t)? = f([oz/r(@)| Nmz/x(@w)"t) definition of apy, 
= f ([xe/1B)] Nmz/x(@)~*t) 
= [ve/z(B)|f (Nmz/x(w)7"t) f preserves the action of Ox 
= [xe (B)| FO) Nmy/x(@) fixes m7'a/a. 


Modulo 8, ¢;(@) is just the gth power Frobenius map, so we get 


93] Fito = [ve/c(B)] leon: 


Since an isogeny is determined by its action on E[m] for m — oo (the kernel of a nonzero 
isogeny is finite), we get that this is true for F, not just E[m], as needed. 


To study the Galois representation G(K/Hx) + Aut Eis, of E, we reduce modulo a 
prime $8 of L, and show that on this reduced curve, the gth power Frobenius acts exactly as 
multiplication by the Grdssencharacter. In particular, the gth power Frobenius is represented 
by multiplication by xg/~ (8) when we think of Eyors as K/a. Thinking of Eyors as a 2- 
dimensional space Q?, this says exactly that the eigenvalues of the Frobenius acting on Exors 
is exactly ye/i (8) and Xp/,(P). Typically we just restrict our attention to ¢-power torsion 
points for some @. 


SAny number in the form m7 + 1, 7 € @x with norm 1 has norm at least (Nmx /g(m) — 1)? — 1, by the 
triangle inequality. In order for it to have norm 1, 7 = 0. 
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7 L-series of CM elliptic curve 


sec:]-series-cmec 


7.1 Defining the L-function 


We define the L-series of an elliptic curve as the L-series of the corresponding Galois repre- 
sentation. 


Definition 41.7.1: a1: Let F be an elliptic curve defined over K, and p, the associated 
Galois representation G(K/K) > Aut VE & GLo(Q/). 

Define the local L-factor of F at a prime p of K as follows. Choose @ such that p { Z, 
and let 


Ly(E, s) := Lp(pe, 8) = det(1 — q~* Frob(p)|(VE)”)*, 


where g = Mp and J, is the inertia subgroup of G(A’/K). (Choose an embedding Q; > C.) 
The L-series of F’ is the product of local factors 


L,(E£/K, s) : plex (E, s). 


Remark 41.7.2: This is (almost) the same as saying: fix a prime @ and let L(E/K,s) := 
L(p¢, 8). The only difference is that we run into trouble with the local factor Ly(p,, s) on 
the right hand side, so we have to choose a different ¢ and let this local factor be Ly(pe, s) 
instead. 


The following is an equivalent definition (that is more concrete). 


Definition 41.7.3: a:e12 Let N be the conductor? of the elliptic curve E. Define the local 
L-factor by 


( 
na —2s . 1, mLlN 
Ly(B,8)=1- a9 +x(gaq*, ag=9+1-|E(F)|, x(@) = 
0, else 
where g = Mp. Thus 
f —agq-*+aqq-7%, good reduction 
ie) = Lg, split ERP EeCueHOn | 
ll+q°, non-split multiplicative reduction 
il. additive reduction. 


°N is divisible by exactly the primes of bad reduction 
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? 


Note that a,, the “trace of Frobenius,” is related to the number of points of E over F 
Hence the L-function contains information about the number of points of E’ over each F 


q: 


- 

Showing that these two definitions are equivalent requires us to show that (V,E)’* is 2, 
1, or 0-dimensional when F has good, multiplicative, and additive reduction, respectively. 
The general idea is that the action of J, on VE contains exactly the information lost by 
looking at the reduced elliptic curve, since I, is exactly the kernel of D,(K/K) + G(k/k), 
so nontrivial action of J, corresponds to bad reduction. 

In the CM case, we cannot have multiplicative reduction, so the L-series is particularly 
simple. We will show that the two definitions are equivalent in this case. 


Theorem 41.7.4: tnm:cmec:no-mr Let E/K be a CM elliptic curve. Then £ cannot have mul- 
tiplicative reduction at any prime. 


Proof. An elliptic curve E has potential good reduction iff its j-invariant is integral [Sil86) 
VII.5.5}. CM have integral j-invariants, so have potential good reduction, i.e. have good or 
multiplicative reduction. 


Proof that Definitions|41.7.1| and are equivalent in the CM case. Suppose E has CM 
by an order @ in K, and E is defined over L. By Néron-Ogg-Shafarevich, J, acts trivially 
on V;E iff EF has good reduction at p. Let q = Mp. 


In the case of good reduction we need to show det(1—q~* Frob(p)|VeE) = 1—a,a~*+qq-**. 
Every endomorphism ¢ on E satisfies ¢? — tr(¢)¢ + deg(¢) = 0, where tr(¢) = 14+ deg(d) — 
deg(1 — ¢). Since Frob(p) acts as the Frobenius morphism @yp, its characteristic polynomial 
is 


det(A — Frob(p)) = A? — tr(@p)A + deg(dy,). 


But 


deg(p) = 4 
tr(dp) = 1 + deg(¢y) — deg(1 — oy) 
=q+1-—ker(1— ¢,) 
=q+1-|E(F,)|. 


(This part of the proof doesn’t use the fact that E has CM.) 

Since F has no multiplicative reduction by Theorem it remains to prove that 
W := (VE) = 0 when E has multiplicative reduction. We know by Néron-Ogg-Shafarevich 
that dim(W) < 1. But because & is CM, VE = (lim &~"a/a) ® Q has the structure of a 
Ox ® Q-vector space. If a € W, then for any a € 7 aa € W because [a] commutes with 
the Galois action. Hence W is not just a Qy-subspace of V, but also a Ox ® Qy-subspace. 
Hence its dimension over Q, is even, and must be 0. 
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7.2 Analytic continuation 


Theorem 41.7.5 (Deuring): timcmeci Let E/L be an elliptic curve with CM by @x with 
K CL.Then 


L(E/L, s) = L(s, a1) L(s, Wz/z)- 


Corollary 41.7.6 (Analytic continuation of Z-function for CM elliptic curves):  corcmectac 
Let F/L be an elliptic curve with CM by @x. Then L admits an analytic continuation to 
C and satisfies a functional equation relating its values at s and 2 — s. 


This theorem for general elliptic curves is very deep (it follows from the Modularity 
Theorem and the analytic properties of L-functions associated to modular forms). 


Proof of Theorem By Theorem |41.7.4| E& has no multiplicative reduction. Let $B be 


a prime, and consider 2 cases. 


1. EF has good reduction at 8. Choose any @ not dividing $8. The characteristic poly- 
nomial of the action of dy on VeE is det(A — dy|VeE). However, if we make the 
identification Fyo.; = K/a, we have 


VE = lim ("a /a, 


and we know that ¢y acts on Ey. = K/a as multiplication by xz/_ (9B). Therefore, 
the eigenvalues of the action of ¢y on VeE are just xe/_ ($8) and Xp, (4B), and 


det(A — dp|VeE) = (A — xe/i(B))(A — Xai (PB). 
Taking \ = p® and dividing by p?* gives 
Ip(E/L,s) = det(1— p °oy|VeE) = Lys, Xeyt) L(8, Xe/1): 
2. E has bad reduction at $B. Then yz/7(B) = 0 by definition, and Ly(L/L,s) = 1 = 
(1 — xe/x(%))1 — Xeyr(B)) = Ll, Xe/t)L(8, Xx/z)- 
Multiplying together all the local factors gives the result. 


Proof of Corollary The L-functions of Gréssencharacters have analytic continuation 
(Theorem |29]29.7.8, which works for Gréssencharacters as well). Thus the result follows 
directly from Theorem |41.7.5 


Thus we have carried out the program in Section for CM elliptic curves, to get the 
correspondences. 


(CM Elliptic curves) + (Galois representation) + (2 Gréssencharacters) 


Remember Grossencharacters are 1-dimensional automorphic representations. If we wanted 
a modular form, we can use the technique of automorphic induction to construct a modular 
form from 2 Grossencharacters. 
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Part VIII 


Arithmetic Dynamics 
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Chapter 42 


Local dynamics: Good reduction 


In order to study the dynamics of rational maps on Q or a global number field, we reduce 
it modulo various primes to rational maps on local fields, and then piece the information 
together to get information about our original system. 


1 Nonarchimedean chordal metric 


Inspired by the chordal metric on the projective line P!(C) 


o((x1 , y1| [29 ; yo]) = |21Y2 — £2Y1| 
Vlei? + lyl?/le2l? + ly2l? 


giving P!(C) the topology of the Riemann sphere, we define for a nonarchimedean valuation 
v the following: 

IZ1Y2 — L2Yi |v 
max(|21|, [Yilv) max(|2|», (Yale) 


Pv([@1 + yi], [v2 : ye]) = 


It is clear that scaling the coordinates for the two points does not change this value. For con- 
venience, we will often “normalize” coordinates so that x1, y1, V2, y2 € Rand max(|r1|., |yilv) = 
max(|22|,, |yol,) = 1 (i.e. at least one of x1, y; and at least one of x2, y2 is a unit). Then the 
formula becomes 

Po([21 : Yi), [2 : yal) = |t1y2 — Fo¥alo = [Cor v2 DI: 
In particular, 

po([z1 : 1], (0: 1) = |rilv. 

Proposition 42.1.1: p, is a nonarchimedean metric satisfying p,(Pi, P2) < 1 for any P,, Po. 


Proposition 42.1.2: The metric p, is invariant under fractional linear transformations. 
That is, letting 


f(x,y) = 


ax + by ‘ b 


PGL 
cx + dy’ 7 € PGL2(R), 
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we have that 
Pol f (Pi), f(P2)) = pol Pi, Pe). 
Note: for convenience, we will sometimes write f € PGL2(R). 


Proof. Normalize coordinates. Note that |ax; + by; : cx; + dy;| are normalized coordinates 
for f(P;) because multiplying the coordinates by the inverse matrix of (% 5) gives 


d —b ax, + by; _ ri\ 
—c a) \cat+dy)  \yis’ 


since max(|z;|,, |yi|,) = 1 and 2;, y; are R-linear combinations of ax; + by; and cx; + dy;, we 
must have max(|ax; + by;|,) = max(|cx; + dy;|,). 
= |det 6 a 
45 Yi Y2 


Hence 
oo(1(P2), £(Pa) = tee|(% 2) (22 22) 


Proof of|42.1.1(2). We may operate by linear fractional transformations on the points P;, P2, P3 
without changing the values on either side. Hence we make the following reductions. 


= pul Pi, Pa). 


1. Applying f = - as necessary, we can assume |Z], < |yol|, = 1. 


2. Apply f = 2*>* so that |P2| = [0: 1]. (Note (% ~72) € PGLo(R) since |y2|, = 1.) 
The inequality now follows from 


P( Pi, P3) = |x1y3 — silo 
< max{|r1ys|v, |vsyilo} 
max{||p; |r3lo} 

= max{py(Pi, P2), Py( Pe, P3)} since P, = [0: 1D 
Definition 42.1.3: Let (K,|-|) bea field with valuation, and ¢(z) € K(z) be a nonconstant 
rational map. The multiplier of ¢ at a fixed point a € K is 


A | 


If a has exact period n for ¢, then we define 


Aa(d) = ($")'(a) = $'(a)¢'(G(a)) --- o'(G""*(a))- 
(The latter follows by the chain rule.) We say that 


{ superattracting, if \.(¢) = 0 
i attracting, i Agito) < 1 
| neutral, it Awl) = i 
i repelling, Hh el) > dL. 


If X\.(¢) = 1, we say that ¢ is rationally or irrationally neutral according to whether or not 
Ao(¢) is a root of unity. [Analogy with C case?] 
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2 Reduction of maps 


Let K be a field with normalized discrete valuation v, let R be the ring of integers, p the 
maximal ideal, and k = R/p the residue field. Given a point P € P’ (kK), choose coordinates 
[wp : ... : &,] so that 2; € R for all j and at least one x; has valuation 0, and define 


P= Pip ceas Cel: 
We similarly define the reduction of a rational map ¢ as follows: First write (X,Y) = 
[F(X,Y) : G(X,Y)] in normalized form, ie. F,G € R[X,Y] and at least one coefficient of 


F or Gis in R*. Then we let ¢ = [F: Gl. 
Proposition 42.2.1 (Basic properties of reduction): 
1. P, = Pp if and only if py(Pi, P) < 1. 
2. For P,Q € P'(K) and f € PGL2(R), P = Q if and only if f(P) = f(Q). 


3. Let P,, P2, P3 be points with distinct reductions. There exists a fractional linear trans- 
formation f € PGL2(R) such that 


f(Pi) =0, f (Pa) =1, and f(P3) = 00. 
(Note that we can always find f € PGL2(K).) 
Proof. Normalize coordinates. 
1. Suppose x1yY2 = ®2y; (mod p). If x1;x2 #0 (mod p), then 
Py = [B18 : Five) = [Bik : iF] = Pr. 
If 2122 =0 (mod p), then P, = P) = (0: 1]. 
2. Combine part 1 with Proposition [42.1.2] 
3. We build f as a composition of the following. 


(a) Applying f: = + = (4) as necessary, we may assume v(x1) > v(y1). Normalize 
again so u(y,) = 0. 


(b) Apply fe = 445"¥ = (4 #1) so P, = (0: 1] =0. 


0 1 
}, 23), which fixes P; and send P; to [1 : 0] = co. 


( 
(c) Apply fs = ox*ay = 
(d) Apply fa = we = (% ° ) to scale P. (Map in PGL(R)?) 


Define the resultant of ¢ = |F' : G] to be Res(F,G) (see Section [9[5p. Note @ is defined 
up to (R*)*¢ where d = deg ¢. 
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Theorem 42.2.2 (Upper bound on expansion in chordal metric): Let ¢: P'(K) > P!(K). 
Then 
po(P(P1), d(P2)) < | Res(9) |," pu( Pi, Pa). 


Proof. Let {x : y] be normalized. By Proposition 2) (suitably homogenized), there exist 
F,, G1, Fo, G2 such that 


F\F + GG = Res(¢)X74"! 
FLF + GoG = Res(¢)Y"*"*. 
By the triangle inequality, 
| Res(g)X°"|, < max(|F (x,y) |v, |G(x, y) |v) 
| Res(d)¥**" |) < max(|F(x,y)|v, |G(x, y)|v) 
Since max{|z|y, |y|o} = 1, we get 
| Res(#) |, < max(|F'(x, y)lo, |G(z, y) lv) (42.1) 


which bounds the extent to which F(x, y), G(x, y) can both be divisible by high powers of p. 

Take P,, P) to be normalized. We have the factorization 

F(X1,Y1)G(Xo, Yo) — F( Xo, Y2)G(X1, Yi) = (X1Y2 — XeY1) W(X, N%, Xo, Yo). (42.2) 

—>—_—_—_$__E 
ER[X1,Y1,X2,Y2] 
Hence 
F(X1,Y1)G(Xo, Yo) — F(X, Y2)G(X1, Y1)|> 
py(o(P,), 6(P2)) = | ( 1 1) ( 2 2) ( 2 2) ( 1 1)| 
max{|F'(X4, Yi)|v, |G(X1, Yi) |v} max{|F'(X2, Yo) |v, |G(X2, Ya)|o} 
IF (YX, Y{)G(Xo, Y2) = F(X2,Y2)G(X, Yale 


| Res(@)|? 
|(X1Yo — XeVY1)H(X1, Xe, V1, Yale 
7 | Res(@)|> 
Po( Pi, Po) 
~ |Res(@)|2 


Proposition 42.2.3: Let ¢ : P! — P' be defined over K, and write ¢ = [F : G] in 
normalized form. The following are equivalent. 


1. degg= deg d. 
2. F(X,Y) = G(X,Y) =0 has no solutions [a : 6] € P'(k). 
3. Res(¢) € R*. 
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4. Res(F’,G) 40. 
We say that ¢ has good reduction if the above are satisfied. 
Proof. Note that deg ¢ — deg ¢ equals the number of common roots of F = G. This shows 


(1) == (2). Now (2), (3), and (4) are equivalent by applying Proposition 9}9.5.2] to F and 
G. 


Proposition 42.2.4 (Basic facts about reduction): Let ¢,~ : P! — P! be rational maps 
with good reduction. 


1. 6(P) = @(P) for all P € P'(K). 
2. dow has good reduction and dow = dow. 


3. Reduction sends Per(¢) + Per(¢) and PrePer(¢) — PrePer(¢). Moreover it preserves 
exact periods. 


Proof. Use the characterization of good reduction given by Proposition |42.2.3(2). 


Definition 42.2.5: The Fatou set of ¢ is the maximal open set on which {¢” : n € N} is 
equicontinuous. The Julia set is the complement of the Fatou set. 


Theorem 42.2.6: Let ¢: P' > P! be a rational map with good reduction. Then 


1. ¢ is everywhere nonexpanding: 
Pv(O(P1), @(P2)) a Pol Pi Py). 


2. ¢ has empty Julia set. 


Proof. 1. Use Theorem /42.2.2] and the fact that p,(P,, P2) < 1 when P,=P (Proposi- 
tion ??(1)). 


2. A nonexpanding map is equicontinuous with constant 1. 


3 Periodic points 
We now characterize periodic points of ¢. 


Theorem 42.3.1: Let (K,|- |,) be a nonarchimedean local field, k be its residue field, 
and ¢ : P'(K) > P!(K) be a rational function of degree d > 2 with good reduction. Let 
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P € P'(K) be a periodic point of ¢. Let 

n = period of P for @ 

m = period of P for db 

r = order of A3(P) = (™)'(P) in k* 

p= ki. 
Then n =m, or mrp® for some e € No. 
Proof. Replacing ¢ by ¢” and m by 1, we may assume m = 1, i.e. P isa fixed point of d. If 
o(P) = P we are in the first case, so assume this does not happen. We may further assume 
P =(0: 1], by taking f sending [0,1] to P and replacing ¢ with f~!o do f. 


Our main technique is to write the iterates ¢'(0) in terms of ¢(0) by considering the 
Taylor expansion. Write 


= agX4t+++++a9 
0(Z) _ bgt ice dy 
where ps = $° € p (because 0¢(0) = 0) and A = ¢’(0). By induction, we find that 


= ppt dz+-- 


i _ oo i—1 a woes 
d'(z) = w+A+-:-+A Do? ad, er 
¢*(0) (9*)/(0) 
Since @"(0) = 0, this gives 
L+A++:-+A"™t=0 (mod p). (42.3) 
Consider two cases. 
1. X41 (mod p). Then r > 2. Multiplying (42.3) by A — 1 gives \” = 1 (mod p). This 
shows r|n. Ifn #1, then replace ¢ with ¢”. Then A is replaced with A", so we are in 
the second case. 


2. \=1 (mod p). Then (42.3) gives us n = 0 (mod p); hence p | n and we can replace 
@ with @? and n by > Then we are in this case again, and we repeat until n = 1. 


Corollary 42.3.2: Let 6: P' > P! be a rational map with good reduction. 


1. Every periodic point of @ is nonrepelling. 


2. db is separable, then @ has finitely many attracting periodic points. 


Theorem 42.3.3: Let K be number field, and ¢ : P! > P! be a rational map over K. 
Suppose @ has good reduction at p and q, with different residue characteristics. Let P be a 
periodic point with period n. Then 


n < (Np? — 1)(Nq? — 1). 
In particular, Per(¢, K’) is finite for any ¢. 
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Proof. We have that 


mp < |P*(Fp)| =Np +1 
=Np-1 


rps |F; 


and similarly for q. By Theorem |42.3.1| we get 
n= ith = marigh 


for some e, f € {0,1} and e’, f’ € No. Since p,q are relatively prime, n < mprpmqrq, giving 
the desired bound. 

The second part now follows from the fact that the coefficients of @ can have nonzero 
valuation only for a finite number of primes, and the fact that @"(P) = P can only have 
finitely many solutions for a fixed n. 
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